International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 52 - Number 13 |
Year of Publication: 2012 |
Authors: J. I. Sheeba, K. Vivekanandan |
10.5120/8260-1800 |
J. I. Sheeba, K. Vivekanandan . Improved Keyword and Keyphrase Extraction from Meeting Transcripts. International Journal of Computer Applications. 52, 13 ( August 2012), 11-15. DOI=10.5120/8260-1800
Keywords play a vital role in extracting the correct information as per user requirements. Keywords are like index terms that contain the most important information about the content of the document. Keyword Extraction is the task of identifying a keyword or keyphrase from a document that can help users easily to understand the documents . Meeting transcripts is significantly different from document or other speech domains. This paper aims to extract keywords and keyphrases from meeting transcripts and also to add some additional features for improving the keyword and keyphrase extraction method . Here, this method is performed by both human transcripts and ASR transcripts and the keywords are extracted through MaxEnt and SVM classifier and Extraction of bigram and trigram keywords retrieval using N-gram based approach efficiently and also to identify the low frequency keywords using LDA (Latent Dirichlet Approach). Finally, the quality of the Extracted keywords is improved using pattern features through sequential pattern mining.