Side Information Gathering for Mining
Text Data

Naveena.M; Karthik.R; Balaji.M

Abstrato

Side Information Gathering for Mining Text Data

Naveena.M, Karthik.R, Balaji.M

In many text mining applications, side-information is available along with the text documents. Such sideinformation may be of different kinds, such as document provenance information, the links in the document, user-access behavior from b logs, or other non-textual attributes which are embedded into the text document. Such attributes may contain a tremendous amount of information for clustering purposes. Hover, the relative importance of this sideinformation may be difficult to estimate, especially when some of the information is noisy. In such cases, it can be risky to incorporate side-information into the mining process, because it can either improve the quality of the representation for the mining process, or can add noise to the process. Therefore, need a principled way to perform the mining process, so as to maximize the advantages from using this side information. In this paper, design an algorithm which combines classical partitioning algorithms with probabilistic models in order to create an effective clustering approach. then show how to extend the approach to the classification problem. present experimental results on a number of real data sets in order to illustrate the advantages of using such an approach.

Isenção de responsabilidade: Este resumo foi traduzido usando ferramentas de inteligência artificial e ainda não foi revisado ou verificado

Destaques do diário

Adaptativo Algoritmos Numéricos Avançados Armazenamento de dados Arquiteturas de computação avançadas Banda larga e redes inteligentes Bioinformática e Biologia Computacional Computação autônoma e sensível ao contexto Computação em grade Estrutura de dados Middleware baseado em agente Padrão de Inteligência Artificial/Reconhecimento de Imagem Protocolo de comunicação CDMA/GSM Rede ad hoc Robótica Segurança de banco de dados Sensores sem fio Sistemas de segurança Software livre Tecnologia Calma Tecnologia de radar

Indexado em

Index Copernicus

Academic Keys

CiteFactor

Cosmos IF

RefSeek

Hamdard University

World Catalogue of Scientific Journals

International Innovative Journal Impact Factor (IIJIF)

International Institute of Organised Research (I2OR)

Cosmos

Veja mais

Revistas Internacionais

Ciências Farmacêuticas Ciências Gerais Ciências Médicas Engenharia

Revista Internacional de Pesquisa Inovadora em Engenharia de Computação e Comunicação

Abstrato

Side Information Gathering for Mining Text Data

Destaques do diário

Indexado em

Revistas Internacionais

Endereço