Search for dissertations about: "document clustering"

Showing result 1 - 5 of 7 swedish dissertations containing the words document clustering.

  1. 1. Clustering in Swedish : The Impact of some Properties of the Swedish Language on Document Clustering and an Evaluation Method

    Author : Magnus Rosell; Viggo Kann; Björn Levin; KTH; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; Document Clustering; Language technology; Språkteknologi;

    Abstract : Text clustering divides a set of texts into groups, so that texts within each group are similar in content. It may be used to uncover the structure and content of unknown text sets as well as to give new perspectives on known ones. READ MORE

  2. 2. Deep learning for news topic identification in limited supervision and unsupervised settings

    Author : Arezoo Hatefi; Frank Drewes; Johanna Björklund; Xuan-Son Vu; Eric Gaussier; Umeå universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; Topic Identification; Data Clustering; News Stream Clustering; Semi-Supervised Learning; Unsupervised Learning; Event Topics; News Stories; Multimodal News; Document Classification; Document Clustering; Deep Learning; Deep Clustering; Pre-trained Language Models;

    Abstract : In today's world, following news is crucial for decision-making and staying informed. With the growing volume of daily news, automated processing is essential for timely insights and in aiding individuals and corporations in navigating the complexities of the information society. READ MORE

  3. 3. Multi-Document Summarization and Semantic Relatedness

    Author : Olof Mogren; Chalmers tekniska högskola; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; semantic relatedness; multi-document summarization; semantic similarity; automatic summarization;

    Abstract : Automatic summarization is the process of presenting the contents of written documents in a short, comprehensive fashion. Many approaches have been proposed for this problem, some of which extract content from the input documents (extractive methods), and others that generate the language in the summary based on some representation of the document contents (abstractive methods). READ MORE

  4. 4. Automated subject classification of textual web pages, for browsing

    Author : Koraljka Golub; Institutionen för elektro- och informationsteknik; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; automated classification; subject browsing; structural Web-page elements; Web page classification; document clustering; bibliographic coupling; text categorization; Subject classification;

    Abstract : With the exponential growth of the World Wide Web, automated subject classification of Web pages has become a major research issue in information and computer sciences. Organizing Web pages into a hierarchical structure for subject browsing is gaining more recognition as an important tool in information-seeking processes. READ MORE

  5. 5. Gossip-based Algorithms for Information Dissemination and Graph Clustering

    Author : Fatemeh Rahimian; Seif Haridi; Eiko Yoneki; KTH; []
    Keywords : TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES;

    Abstract : Decentralized algorithms are becoming ever more prevalent in almost all real-world applications that are either data intensive, computation intensive or both. This thesis presents a few decentralized solutions for large-scale (i) data dissemination, (ii) graph partitioning, and (iii) data disambiguation. READ MORE