Stemming Algorithms


Stemming


GRAS: An effective and efficient stemming algorithm for information retrieval JH Paik, M Mitra, SK Parui… – ACM Transactions on …, 2011 – dl.acm.org Abstract A novel graph-based language-independent stemming algorithm suitable for  information retrieval is proposed in this article. The main features of the algorithm are  retrieval effectiveness, generality, and computational efficiency. We test our approach on … Cited by 1

{The Porter Stemming Algorithm} M Porter – 2009 – citeulike.org … Register and you can start organising your references online. Tags. The Porter Stemming Algorithm. by: M. Porter. RIS, Export as RIS which can be imported into most citation managers. BibTeX, Export as BibTeX which can be imported into most citation/bibliography managers. … Cited by 135 – Related articles – Cached

[CITATION] Snowball: A language for stemming algorithms, 2001 M Porter – URL http://snowball. tartarus. org/texts/introduction. …, 2009 Cited by 20 – Related articles

[CITATION] Stemming Algorithm M Porter – 2010 Cited by 2 – Related articles

A novel corpus-based stemming algorithm using co-occurrence statistics [PDF] from 202.113.25.19 JH Paik, D Pal… – Proceedings of the 34th international ACM …, 2011 – dl.acm.org Abstract We present a stemming algorithm for text retrieval. The algorithm uses the statistics  collected on the basis of certain corpus analysis based on the co-occurrence between two  word variants. We use a very simple co-occurrence measure that reflects how often a pair … Related articles – All 3 versions

[CITATION] The ‘Official’home page for distribution of the Porter Stemming Algorithm M Porter – Website http://www. tartarus. org/~ martin/ …, 2008 Cited by 2 – Related articles

The Porter stemming algorithm: then and now [PDF] from whiterose.ac.uk P Willett – Program: electronic library and information systems, 2006 – emeraldinsight.com Purpose-In 1980, Porter presented a simple algorithm for stemming English language  words. This paper summarises the main features of the algorithm, and highlights its role not  just in modern information retrieval research, but also in a range of related subject … Cited by 21 – Related articles – BL Direct – All 12 versions

WCI 02 Improvements on the Porter’s Stemming Algorithm for Portuguese MVB Soares, RC Prati… – Latin America Transactions …, 2009 – ieeexplore.ieee.org Abstract The amount of textual information digitally stored is growing every day. However,  our capability of processing and analyzing that information is not growing at the same pace.  To overcome this limitation, it is important to develop semi-automatic processes to extract … Cited by 3 – Related articles – All 2 versions

[PDF] A rule-based Arabic stemming algorithm [PDF] from wseas.us TMT Sembok, BMA Ata… – … of the 5th European conference on …, 2011 – wseas.us Abstract:-Stemming is used in information retrieval systems to reduce variant word forms to  common roots in order to improve retrieval effectiveness. As in other languages, there is a  need for an effective stemming algorithm for the indexing and retrieval of Arabic … Related articles – View as HTML – All 2 versions

Evaluation of perstem: a simple and efficient stemming algorithm for Persian A Jadidinejad, F Mahmoudi… – Multilingual Information Access …, 2011 – Springer Persian is a challenging language in the field of NLP. Right-to-left orthography, complex  morphology, complicated grammatical rules, and different forms of letters make it an  interesting language for NLP research. In this paper we measure the effectiveness of a … Related articles – All 2 versions

A stemming algorithm for the farsi language [PDF] from psu.edu K Taghva, R Beckley… – … Technology: Coding and …, 2005 – ieeexplore.ieee.org Abstract In this paper, we report on the design and implementation of a stemmer for the Farsi  language. The results of our evaluation on a small Farsi document collection shows a  significant improvement in precision/recall over not stemming. Cited by 26 – Related articles – All 13 versions

Strength and similarity of affix removal stemming algorithms [PDF] from sigir.org WB Frakes… – ACM SIGIR Forum, 2003 – dl.acm.org Abstract This study evaluated the strength of, and similarity among, four affix removal  stemming algorithms. Strength and similarity were evaluated in different ways, including new  metrics based on the Hamming distance measure. Data was collected on stemmer outputs … Cited by 45 – Related articles – BL Direct – All 7 versions

Using Stemming Algorithms on a Grid Environment [PDF] from up.pt V Roncero, M Costa… – High Performance Computing for …, 2008 – Springer Stemming algorithms are commonly used in Information Retrieval with the goal of reducing  the number of the words which are in the same morpho-logical variant in a common  representation. Stemming analysis is one of the tasks of the pre-processing phase on text … Related articles – All 3 versions

Two Algorithms for Probabilistic Stemming M Melucci, N Orio – Information Access through Search Engines and …, 2008 – Springer This chapter describes two algorithms for probabilistic stemming. A probabilistic stemmer  aims at detecting word stems by using a probabilistic or statistical model with no or very little  knowledge about the language for which the stemmer has been built. While illustrating … Related articles

Analysis and Algorithms for Stemming Inversion I Feinerer – Information Retrieval Technology, 2010 – Springer Stemming is a fundamental technique for processing large amounts of data in information  retrieval and text mining. However, after processing the reversal of this process is often  desirable, eg, for human interpretation, or methods which operate on sequences of … Related articles – All 2 versions

[PDF] Overview of Stemming Algorithms [PDF] from the-smirnovs.org I Smirnov – DePaul University, 2008 – the-smirnovs.org This paper is an overview of the state-of-the-art in the area of stemming and lemmatization  algorithms. It covers basic ideas of “classical”(affix removal) techniques as well as some  recent approaches like stochastic algorithms. The paper scope is restricted by techniques … Cited by 4 – Related articles – View as HTML

Stemming Algorithm to Classify Arabic Documents MAH Omer… – 2010 – Citeseer Abstract Text classification is the problem of assigning predefined class labels to incoming  unclassified documents. Many algorithms and researches have been implemented for  English, Chinese and other languages, while there is few researches introduced for … Related articles – Cached – All 5 versions

[PDF] A new Arabic stemming algorithm [PDF] from uoa.gr ET AlShammari… – Experimental Linguistics ExLing 2008, 2008 – users.uoa.gr Abstract Text processing is a vital step in the information retrieval process, text mining, and  natural language processing. It includes several stages, such as normalization, stop word  removal, and stemming. Stemming is the process of reducing the lexicon to its root. Due to … Related articles – View as HTML – All 7 versions

[CITATION] Evaluation of Lovins Stemming Algorithm in Large Database Systems JLK Serrano – 2008 Related articles

[CITATION] Python Implementation of Porter Stemming Algorithm V Gupta – Obtido em: http://tartarus. org/martin/PorterStemmer/ …, 2008 Cited by 2 – Related articles

STEMBR: A stemming algorithm for the brazilian portuguese language R Alvares, A Garcia… – Progress in Artificial Intelligence, 2005 – Springer Stemming algorithms have traditionally been utilized in information retrieval systems as they  generate a more concise word representation. However, the efficiency of these algorithms  varies according to the language they are used with. This paper presents STEMBR, a … Cited by 8 – Related articles – BL Direct – All 2 versions

Improving query expansion with stemming terms: A new genetic algorithm approach [PDF] from uned.es L Araujo… – Evolutionary Computation in Combinatorial …, 2008 – Springer Nowadays, searching information in the web or in any kind of document collection has  become one of the most frequent activities. However, user queries can be formulated in a  way that hinder the recovery of the requested information. The objective of automatic … Cited by 8 – Related articles – BL Direct – All 8 versions

The Effiectiveness of a Graph-Based Algorithm for Stemming M Bacchin, N Ferro… – Digital Libraries: People, Knowledge, …, 2002 – Springer In Information Retrieval (IR), stemming enables a matching of query and document terms  which are related to a same meaning but which can appear in different morphological  variants. In this paper we will propose and evaluate a statistical graph-based algorithm for … Cited by 17 – Related articles – BL Direct – All 6 versions

Is paice method suitable for evaluating Arabic stemming algorithms? HM AlSerhan, S Alqrainy… – Computer Engineering & …, 2008 – ieeexplore.ieee.org Abstract There are many measurement methodologies used to measure the quality of  stemming algorithms and to evaluate their effectiveness. All of these measurement are  designed for English language. In this study we trying to check the viability of the Paice … Related articles – All 5 versions

FindStem: Analysis and evaluation of a Turkish stemming algorithm [PDF] from hacettepe.edu.tr H Sever… – String Processing and Information Retrieval, 2003 – Springer In this paper, we evaluate the effectiveness of a new stemming algorithm, FINDSTEM, for  use with Turkish documents and queries, and compare the use of this algorithm with the  other two previously defined Turkish stemmers, namely” AF” and” LM” algorithms. Of them, … Cited by 13 – Related articles – BL Direct – All 10 versions

[CITATION] Paice/Husk Stemming Algorithm Implemented Over SP Search Index CJE Bacani – University of the Philippines Los Banos, Laguna. …, 2006 Cited by 4 – Related articles

[CITATION] A Stemming Algorithm for Tagalog Words E Bonus – Philippines: De La Salle University, 2003 Cited by 8 – Related articles

[PDF] University of Padua at CLEF 2002: Experiments to evaluate a statistical stemming algorithm [PDF] from psu.edu M Bacchin, N Ferro… – Proceedings of CLEF, 2002 – Citeseer Abstract In Information Retrieval (IR), stemming is used to reduce variant word forms to  common root. The assumption is that if two words have the same root, then they represent  the same concept. Hence stemming permits a IR system to match query and document … Cited by 12 – Related articles – View as HTML – All 8 versions

[CITATION] A stemming algorithm for Malay language MT Abdullah, F Ahmad, R Mahmod… – Proceedings of the 4th …, 2005 Cited by 4 – Related articles – All 2 versions

A generalization of the method for evaluation of stemming algorithms based on error counting R de Madariaga, J del Castillo… – String Processing and …, 2005 – Springer Until the introduction of the method for evaluation of stemming algorithms based on error  counting, the effectiveness of these algorithms was compared by determining their retrieval  performance for various experimental test collections. With this method, the performance … Cited by 4 – Related articles – BL Direct – All 3 versions

Overcoming stiffness in stochastic simulation stemming from partial equilibrium: A multiscale Monte Carlo algorithm A Samant… – The Journal of chemical physics, 2005 – link.aip.org In this paper the problem of stiffness in stochastic simulation of singularly perturbed systems  is discussed. Such stiffness arises often from partial equilibrium or quasi-steady-state type of  conditions. A multiscale Monte Carlo method is discussed that first assesses whether … Cited by 57 – Related articles – BL Direct – All 6 versions

[CITATION] Online Song Search Engine Using Porter Stemming Algorithm for Keyword Matching EC Magallanes – 2008 Cited by 1 – Related articles

[CITATION] Rstem: Interface to Snowball implementation of Porter’s word stemming algorithm D Temple Lang – R package version 0.3-1, 2006 Cited by 3 – Related articles

A Prospective Study of Stemming Algorithms for Web Text Mining [PDF] from ganpatuniversity.ac.in GN Shakarad – Ganpat University Journal of …, 2011 – gnujet.ganpatuniversity.ac.in Abstract Information Retrieval (IR) is essentially a matter of deciding which documents in a  collection should be retrieved to satisfy a user’s need for information. The user’s need for  information is represented by a query or profile, and contains one or more search terms, … Related articles – All 2 versions

Study of stemming algorithms S Kodimala – 2010 – digitalcommons.library.unlv.edu … UNLV Theses/Dissertations/Professional Papers/Capstones. Title. Study of stemming algorithms. Author. … Repository Citation. Kodimala, Savitha, “Study of stemming algorithms” (2010). UNLV Theses/Dissertations/Professional Papers/Capstones. Paper 754. … Cached

[CITATION] The Porter Stemming Algorithm: Available at: http://www. tartarus. org/martin M Porter – 2006 – PorterStemmer Cited by 2 – Related articles

[CITATION] The Porter Stemming Algorithm official home page MF Porter – 2006 Cited by 2 – Related articles

[CITATION] The Lovins stemming algorithm M Porter… – 2004 Cited by 4 – Related articles

[PDF] Word Stemming Algorithms and Retrieval Effectiveness in Malay and Arabic Documents Retrieval Systems [PDF] from psu.edu TMT Sembok – 2005 – Citeseer Systems (IRS) is generally about understanding of information in the documents concern.  The more the system able to understand the contents of documents the more effective will be  the retrieval outcomes. But understanding of the contents is a very complex task. … Cited by 3 – Related articles – View as HTML – All 7 versions

[CITATION] Evaluation of Paice/Husk Stemming Algorithm in Large Database Systems C Atienza – 2008 Related articles

An Evaluation of Existing Light Stemming Algorithms for Arabic Keyword Searches [PDF] from unc.edu BE Rogerson – 2008 – etd.ils.unc.edu Abstract: The field of Information Retrieval recognizes the importance of stemming in  improving retrieval effectiveness. This same tool, when applied to searches conducted in the  Arabic language, increases the relevancy of documents returned and expands searches … Related articles – All 4 versions

[CITATION] The Lancaster stemming algorithm R Hooper… – 2005 Cited by 2 – Related articles

[CITATION] A Generalization of the Method for Evaluation of Stemming Algorithms Based on Error Counting RM Sanchez, JR Fernández… – Proceedings of SPIRE, 2005 Cited by 2 – Related articles

[CITATION] The tagalog stemming algorithm B Bonus – 1st National Natural Language Processing Research …, 2004 Cited by 3 – Related articles

[CITATION] The english (porter2) stemming algorithm MF Porter, R Boulton… – Retrieved, 2002 Cited by 6 – Related articles

[PDF] Further Enhancement to the Porter’s Stemming Algorithm [PDF] from uni-weimar.de F Yamout, R Demachkieh, G Hamdan… – Ulm, September 21, …, 2004 – uni-weimar.de Abstract. Stemming algorithms are used to transform the words in texts into their grammatical  root form, and are mainly used to improve the Information Retrieval System’s efficiency.  Several algorithms exist with different techniques. The most widely used is the Porter … Cited by 2 – Related articles – View as HTML – All 10 versions

[PDF] A Spanish Stemming Algorithm Implementation in PROLOG and C# [PDF] from uga.edu DDP Barrenechea – 2006 – ai.uga.edu Abstract This paper presents two implementations of a spanish stemming algorithm in  Prolog and C#. The basis for the implementations is a Porter-like algorithm published by the  Snowball Project. Some additions to the original algorithm are proposed and included in … Related articles – View as HTML – All 6 versions

[PDF] ST ANS Algorithm for Root Word Stemming [PDF] from 198.170.104.138 S Srinivasan… – Information Technology Journal, 2006 – 198.170.104.138 Abstract: Information Retrieval (IR) is essentially a matter of deciding which documents in a  collection should be retrieved to satisfy a user’s need for information. Inmost cases,  morphological variants of words have similar semantic interpretations and can be … Related articles – All 7 versions

A new stemming algorithm to extract quadri-literal Arabic roots G Kanaan, R Al-Shalabi, JM Jaam… – … : From Theory to …, 2004 – ieeexplore.ieee.org Abstract Summary form only given. We present a new stemming algorithm to extract quadri- literal Arabic roots. The algorithm starts by excluding the prefixes and checks then the word  characters starting from the last letter backward to the first one. A temporary matrix is used … Cited by 1 – Related articles

[CITATION] Searching malay text using stemming algorithm R Saian… – 2004 – JICT Cited by 1 – Related articles

[CITATION] Analysis and evaluation of a Turkish stemming algorithm H Serer… – 10th International Symposium SPIRE, 2003 Cited by 2 – Related articles

[CITATION] Strength and similarity of affix removal stemming algorithms WFC Fox… – SIGIR Forum, 2003 Cited by 2 – Related articles

[CITATION] A Stemming Algorithm for Tagalog Words. Manila: De La Salle University DE Bonus – 2003 – MS Thesis Cited by 2 – Related articles

Stemming Algorithm to Classify Arabic Documents MAI H. Omer Shilong Ma – ??????: ????, 2010 – cqvip.com ???? >> ?????? >> ????? >> ??. Stemming Algorithm to Classify Arabic Documents. ???? ??:. ???? ????????. Marwan AIi.H. Omer Shilong Ma. School of Computer Science and Engineering … Related articles

[PDF] Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word [PDF] from uitm.edu.my G Edatul Muliana – 2005 – eprints.ptar.uitm.edu.my UNIVERSITI TEKNOLOGI MARA Digital Repository is powered by EPrints 3 which is  developed by the School of Electronics and Computer Science at the University of  Southampton. More information and software credits. View as HTML

Enhancement of rules-application-order (RAO) stemming algorithm based on the first character of Malay word/Edatul Muliana binti Ghazalli [PDF] from uitm.edu.my G Edatul Muliana – 2005 – eprints.ptar.uitm.edu.my Abstract Stemming is important thing to improve retrieval effectiveness. Stemming is used to  reduce the size of indexing file for relevancy of document retrieval. Stemming is technique to  truncate the word into the root word that will reduce vocabulary size and improve recall. …

[CITATION] A language-independent Stemming Algorithm M Bacchin – 2002 – Ph. D. Thesis, Department of … Cited by 2 – Related articles

[CITATION] The analysis and evaluation of stemming algorithms for Turkish H Sever… – 10th International Symposium on String Processing …, 2003 Cited by 1 – Related articles

Development of stemming algorithm for wolaytta text [PDF] from aau.edu.et L LESSA – 2003 – etd.aau.edu.et Abstract: This study describes the design of a stemming algorithm for Wolaytta language. To  give a solid background for the thesis, literatures on conflation in general and stemming  algorithms in particular were reviewed. Since it is the nature and characteristics of … Cited by 1 – Related articles

[CITATION] Improved porter’s algorithm for root word stemming M Saravanan, PCR Raj, VS Murthy… – Proc. of International Conference on …, 2002 Cited by 4 – Related articles

DEVELOPMENT OF STEMMING ALGORITHM FOR WOLAYTTA TEXT [PDF] from aau.edu.et D Amogne – 2003 – etd.aau.edu.et Abstract: This study describes the design of a stemming algorithm for Wolaytta language. To  give a solid background for the thesis, literatures on conflation in general and stemming  algorithms in particular were reviewed. Since it is the nature and characteristics of … Related articles

[CITATION] Developing a word-stemming program using Porter’s Algorithm KV Lakshmi – NCSI minor project report, 2002 Cited by 2 – Related articles

[CITATION] Stemming for Complex Medical Spanish Words: Algorithm for multipurpose languages PE Jesus – 2002 Related articles