Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
161 | Eiman Tamah Al-Shammari, Jessica Lin 0001 |
A novel Arabic lemmatization algorithm. |
AND |
2008 |
DBLP DOI BibTeX RDF |
text mining, tokenization, stemming, Arabic, lemmatization |
142 | Tuomo Korenius, Jorma Laurikkala, Kalervo Järvelin, Martti Juhola |
Stemming and lemmatization in the clustering of finnish text documents. |
CIKM |
2004 |
DBLP DOI BibTeX RDF |
clustering, normalization, stemming, lemmatization |
98 | Jakub Kanis, Ludek Müller |
Automatic Lemmatizer Construction with Focus on OOV Words Lemmatization. |
TSD |
2005 |
DBLP DOI BibTeX RDF |
|
79 | Iraide Zipitria, Ana Arruarte Lasa, Jon A. Elorriaga |
Observing Lemmatization Effect in LSA Coherence and Comprehension Grading of Learner Summaries. |
Intelligent Tutoring Systems |
2006 |
DBLP DOI BibTeX RDF |
|
79 | Jakub Kanis, Ludek Müller |
Using the Lemmatization Technique for Phonetic Transcription in Text-to-Speech System. |
TSD |
2004 |
DBLP DOI BibTeX RDF |
|
76 | Kimmo Kettunen 0001, Eija Airio, Kalervo Järvelin |
Restricted inflectional form generation in management of morphological keyword variation. |
Inf. Retr. |
2007 |
DBLP DOI BibTeX RDF |
Best-match IR, Inflected indexes, Frequent case form generation for keywords, Generative methods in management of keyword variation |
66 | Anton Karl Ingason, Sigrún Helgadóttir, Hrafn Loftsson, Eiríkur Rögnvaldsson |
A Mixed Method Lemmatization Algorithm Using a Hierarchy of Linguistic Identities (HOLI). |
GoTAL |
2008 |
DBLP DOI BibTeX RDF |
lemma, BLARK, Icelandic, Lemmald, IceTagger, machine learning, normalization, lemmatization |
66 | Fotis Lazarinis |
Lemmatization and stopword elimination in Greek web searching. |
EATIS |
2007 |
DBLP DOI BibTeX RDF |
stopword, information retrieval, search engines, stemming, lemmatization, Greek |
63 | Eija Airio |
Word normalization and decompounding in mono- and bilingual IR. |
Inf. Retr. |
2006 |
DBLP DOI BibTeX RDF |
Monolingual information retrieval, bilingual information retrieval, decompounding, stemming, lemmatization |
44 | Tim vor der Brück, Steffen Eger, Alexander Mehler |
Lexicon-assisted tagging and lemmatization in Latin: A comparison of six taggers and two lemmatization models. |
LaTeCH@ACL |
2015 |
DBLP DOI BibTeX RDF |
|
44 | Henrik Holmboe |
Lemmatisering - hvilke af de ideelle krav til lemmatisering er opfyldelige eller opfyldte? (Lemmatization - which of the ideal requirements of lemmatization are fulfillable or fulfilled?) [In Danish]. |
NODALIDA |
1979 |
DBLP BibTeX RDF |
|
26 | Eiman Tamah Al-Shammari, Jessica Lin 0001 |
Towards an error-free Arabic stemming. |
CIKM-iNEWS |
2008 |
DBLP DOI BibTeX RDF |
text mining, tokenization, stemming, Arabic, lemmatization |
26 | Zdenek Ceska, Michal Toman, Karel Jezek |
Multilingual Plagiarism Detection. |
AIMSA |
2008 |
DBLP DOI BibTeX RDF |
EuroWordNet, Nature Language Processing, Thesaurus, Plagiarism, Copy Detection, Lemmatization |
26 | Dan Tufis, Dan Stefanescu, Radu Ion, Alexandru Ceausu |
RACAI's Question Answering System at QA@CLEF2007. |
CLEF |
2007 |
DBLP DOI BibTeX RDF |
tagging, question answering, tokenization, query generation, lemmatization, answer extraction, Lucene, indexing and retrieval |
26 | Petr Strossa |
Machine assisted translation from English to a Slavic language. |
Mach. Transl. |
1994 |
DBLP DOI BibTeX RDF |
dictionary databases, lemmatization (stemming) rules, machine assisted translation, Slavic languages, efficiency, English, Czech |
22 | Olia Toporkov, Rodrigo Agerri |
Evaluating Shortest Edit Script Methods for Contextual Lemmatization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
22 | Olia Toporkov, Rodrigo Agerri |
On the Role of Morphological Information for Contextual Lemmatization. |
Comput. Linguistics |
2024 |
DBLP DOI BibTeX RDF |
|
22 | Olia Toporkov, Rodrigo Agerri |
On the Role of Morphological Information for Contextual Lemmatization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Dmitriy Kalugin-Balashov |
Advancing Full-Text Search Lemmatization Techniques with Paradigm Retrieval from OpenCorpora. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Lucca de Freitas Santos, Murilo Varges da Silva |
The effect of stemming and lemmatization on Portuguese fake news text classification. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Gabriela Palka, Artur Nowakowski |
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Shafie Abdi Mohamed, Muhidin Abdullahi Mohamed |
Lexicon and Rule-based Word Lemmatization Approach for the Somali Language. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Péter Berkecz, György Orosz, Zsolt Szántó, Gergo Szabó, Richárd Farkas |
Hybrid lemmatization in HuSpaCy. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Aleksandra Miletic, Janine Siewert |
Lemmatization Experiments on Two Low-Resourced Languages: Low Saxon and Occitan. |
VarDial@EACL |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Liu Jun Yoon, Xuan Yi Tan, Khai Yin Lim, Chi Wee Tan, Ling Ern Cheng, Jenny Tan |
A Comparative Study of Lemmatization Approaches for Rojak Language. |
DaSET |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Aleksei Dorkin, Kairit Sirts |
Comparison of Current Approaches to Lemmatization: A Case Study in Estonian. |
NoDaLiDa |
2023 |
DBLP BibTeX RDF |
|
22 | Shafie Abdi Mohamed, Muhidin A. Mohamed |
Lexicon and Rule-based Word Lemmatization Approach for Somali Language. |
AfricaNLP |
2023 |
DBLP BibTeX RDF |
|
22 | Tomás Jelínek |
Morphological Tagging and Lemmatization of Spoken Corpora of Czech. |
TSD |
2023 |
DBLP DOI BibTeX RDF |
|
22 | Maksud Sharipov, Ogabek Sobirov |
Development of a rule-based lemmatization algorithm through Finite State Machine for Uzbek language. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
22 | Maksud Sharipov, Ogabek Sobirov |
Development of a Rule-based Lemmatization Algorithm Through Finite State Machine for Uzbek Language. |
ALTNLP |
2022 |
DBLP BibTeX RDF |
|
22 | Wilbert Heeringa, Gosse Bouma, Martha Hofman, Jelle Brouwer, Eduard Drenth, Jan Wijffels, Hans Van de Velde |
PoS Tagging, Lemmatization and Dependency Parsing of West Frisian. |
LREC |
2022 |
DBLP BibTeX RDF |
|
22 | Hannah Morcos, Geoffroy Noël, Marcus Husar |
Lemmatization in the collaborative editorial workflow of a medieval French text: The digital edition of the Histoire ancienne jusqu'à César. |
Digit. Scholarsh. Humanit. |
2021 |
DBLP DOI BibTeX RDF |
|
22 | Wilbert Heeringa, Gosse Bouma, Martha Hofman, Eduard Drenth, Jan Wijffels, Hans Van de Velde |
POS tagging, lemmatization and dependency parsing of West Frisian. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
22 | Mika Hämäläinen, Niko Partanen, Khalid Al-Najjar |
Lemmatization of Historical Old Literary Finnish Texts in Modern Orthography. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
22 | Kirill Milintsevich, Kairit Sirts |
Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
22 | Kirill Milintsevich, Kairit Sirts |
Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources. |
EACL |
2021 |
DBLP DOI BibTeX RDF |
|
22 | Cristina Holgado, Alexei Lavrentiev, Mathieu Constant |
Évaluation de méthodes et d'outils pour la lemmatisation automatique du français médiéval (Evaluation of methods and tools for automatic lemmatization in Old French). |
TALN (1) |
2021 |
DBLP BibTeX RDF |
|
22 | Mika Hämäläinen, Niko Partanen, Khalid Al-Najjar |
Lemmatization of Historical Old Literary Finnish Texts in Modern Orthography. |
TALN (1) |
2021 |
DBLP BibTeX RDF |
|
22 | Maria Nutu |
Deep Learning Approach for Automatic Romanian Lemmatization. |
KES |
2021 |
DBLP DOI BibTeX RDF |
|
22 | Iskander Akhmetov, Alexandr A. Pak, Irina Ualiyeva, Alexander F. Gelbukh |
Highly Language-Independent Word Lemmatization Using a Machine-Learning Classifier. |
Computación y Sistemas |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Francesco Camastra, Gennaro Razi |
Italian Text Categorization with Lemmatization and Support Vector Machines. |
Neural Approaches to Dynamics of Signal Exchanges |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Kirill Milintsevich, Kairit Sirts |
Lexicon-Enhanced Neural Lemmatization for Estonian. |
Baltic HLT |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Do Viet Quan, Phan Duy Hung |
Application of Customized Term Frequency-Inverse Document Frequency for Vietnamese Document Classification in Place of Lemmatization. |
ICO |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Aman Priyanshu, Vedant Rishi Das, Shashank Rajiv Moghe, Harsh Rathod, Sai Sravan Medicherla, Mini Shail Chhabra, Sarthak Shastri |
Stance Classification with Improved Elementary Classifiers Using Lemmatization (Grand Challenge). |
BigMM |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Nasser Zalmout, Nizar Habash |
Utilizing Subword Entities in Character-Level Sequence-to-Sequence Lemmatization Models. |
COLING |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Thomas Proisl, Natalie Dykes, Philipp Heinrich, Besim Kabashi, Andreas Blombach, Stefan Evert |
EmpiriST Corpus 2.0: Adding Manual Normalization, Lemmatization and Semantic Tagging to a German Web and CMC Corpus. |
LREC |
2020 |
DBLP BibTeX RDF |
|
22 | Ranka Stankovic, Branislava Sandrih, Cvetana Krstev, Milos Utvic, Mihailo Skoric |
Machine Learning and Deep Neural Network-Based Lemmatization and Morphosyntactic Tagging for Serbian. |
LREC |
2020 |
DBLP BibTeX RDF |
|
22 | Nasser Zalmout, Nizar Habash |
Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging. |
ACL |
2020 |
DBLP DOI BibTeX RDF |
|
22 | Rüdiger Gleim, Steffen Eger, Alexander Mehler, Tolga Uslu, Wahed Hemati, Andy Lücking, Alexander Henlein, Sven Kahlsdorf, Armin Hoenen |
Practitioner's view: A comparison and a survey of lemmatization and morphological tagging in German and Latin. |
J. Lang. Model. |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Mohamed Boudchiche, Azzeddine Mazroui |
A hybrid approach for Arabic lemmatization. |
Int. J. Speech Technol. |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Nasser Zalmout, Nizar Habash |
Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Toms Bergmanis, Sharon Goldwater |
Data Augmentation for Context-Sensitive Neural Lemmatization Using Inflection Tables and Raw Text. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Rudolf Rosa, Zdenek Zabokrtský |
Unsupervised Lemmatization as Embeddings-Based Word Clustering. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Enrique Manjavacas, Ákos Kádár, Mike Kestemont |
Improving Lemmatization of Non-Standard Languages with Joint Learning. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Milan Straka, Jana Straková, Jan Hajic |
Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Chaitanya Malaviya, Shijie Wu, Ryan Cotterell |
A Simple Joint Model for Improved Contextual Neural Lemmatization. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Milan Straka, Jana Straková, Jan Hajic |
Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Nelda Kote, Marenglen Biba, Jenna Kanerva, Samuel Rönnqvist, Filip Ginter |
Morphological Tagging and Lemmatization of Albanian: A Manually Annotated Corpus and Neural Models. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
22 | Enrique Manjavacas, Ákos Kádár, Mike Kestemont |
Improving Lemmatization of Non-Standard Languages with Joint Learning. |
NAACL-HLT (1) |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Chaitanya Malaviya, Shijie Wu, Ryan Cotterell |
A Simple Joint Model for Improved Contextual Neural Lemmatization. |
NAACL-HLT (1) |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Abhisek Chakrabarty, Akshay Chaturvedi, Utpal Garain |
CNN-based Context Sensitive Lemmatization. |
COMAD/CODS |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Francesco Mambrini, Marco Passarotti |
Harmonizing Different Lemmatization Strategies for Building a Knowledge Base of Linguistic Resources for Latin. |
LAW@ACL |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Marine Schmitt, Mathieu Constant |
Neural Lemmatization of Multiword Expressions. |
MWE-WN@ACL |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Christian Wartena |
A Probabilistic Morphology Model for German Lemmatization. |
KONVENS |
2019 |
DBLP BibTeX RDF |
|
22 | Marco Vassallo, Giuliano Gabrieli, Valerio Basile, Cristina Bosco |
The Tenuousness of Lemmatization in Lexicon-based Sentiment Analysis. |
CLiC-it |
2019 |
DBLP BibTeX RDF |
|
22 | Milan Straka, Jana Straková, Jan Hajic |
Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER. |
TSD |
2019 |
DBLP DOI BibTeX RDF |
|
22 | Gor Arakelyan, Karen Hambardzumyan, Hrant Khachatrian |
Towards JointUD: Part-of-speech Tagging and Lemmatization using Recurrent Neural Networks. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
22 | Attila Novák, Borbála Novák |
Identification of Lemmatization Errors Using Neural Models. |
CICLing (1) |
2018 |
DBLP DOI BibTeX RDF |
|
22 | Toms Bergmanis, Sharon Goldwater |
Context Sensitive Neural Lemmatization with Lematus. |
NAACL-HLT |
2018 |
DBLP DOI BibTeX RDF |
|
22 | Giorgio Maria Di Nunzio, Federica Vezzani |
A Linguistic Failure Analysis of Classification of Medical Publications: A Study on Stemming vs Lemmatization. |
CLiC-it |
2018 |
DBLP BibTeX RDF |
|
22 | Gor Arakelyan, Karen Hambardzumyan, Hrant Khachatrian |
Towards JointUD: Part-of-speech Tagging and Lemmatization using Recurrent Neural Networks. |
CoNLL Shared Task (2) |
2018 |
DBLP DOI BibTeX RDF |
|
22 | Abed Alhakim Freihat, Mourad Abbas, Gábor Bella, Fausto Giunchiglia |
Towards an Optimal Solution to Lemmatization in Arabic. |
ACLING |
2018 |
DBLP DOI BibTeX RDF |
|
22 | Samir Amri, Lahbib Zenkouar |
Stemming and Lemmatization for Information Retrieval Systems in Amazigh Language. |
BDCA |
2018 |
DBLP DOI BibTeX RDF |
|
22 | Hamdy Mubarak |
Build Fast and Accurate Lemmatization for Arabic. |
LREC |
2018 |
DBLP BibTeX RDF |
|
22 | Patrick J. Burns |
Backoff Lemmatization as a Philological Method. |
DH |
2018 |
DBLP BibTeX RDF |
|
22 | Mike Kestemont, Guy De Pauw, Renske van Nie, Walter Daelemans |
Lemmatization for variation-rich languages using deep learning. |
Digit. Scholarsh. Humanit. |
2017 |
DBLP DOI BibTeX RDF |
|
22 | Hamdy Mubarak |
Build Fast and Accurate Lemmatization for Arabic. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
22 | Michal Marcinczuk |
Lemmatization of Multi-word Common Noun Phrases and Named Entities in Polish. |
RANLP |
2017 |
DBLP DOI BibTeX RDF |
|
22 | Abhisek Chakrabarty, Onkar Arun Pandit, Utpal Garain |
Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks. |
ACL (1) |
2017 |
DBLP DOI BibTeX RDF |
|
22 | Miikka Silfverberg, Teemu Ruokolainen, Krister Lindén, Mikko Kurimo |
FinnPos: an open-source morphological tagging and lemmatization toolkit for Finnish. |
Lang. Resour. Evaluation |
2016 |
DBLP DOI BibTeX RDF |
|
22 | Fabian Barteld, Ingrid Schröder, Heike Zinsmeister |
Dealing with word-internal modification and spelling variation in data-driven lemmatization. |
LaTeCH@ACL |
2016 |
DBLP DOI BibTeX RDF |
|
22 | Mohammed Attia, Ayah Zirikly, Mona T. Diab |
The Power of Language Music: Arabic Lemmatization through Patterns. |
CogALex@COLING |
2016 |
DBLP BibTeX RDF |
|
22 | Rogerio Bonatti, Arthur G. de Paula, Victor S. Lamarca, Fábio Gagliardi Cozman |
Effect of Part-of-Speech and Lemmatization Filtering in Email Classification for Automatic Reply. |
AAAI Workshop: Knowledge Extraction from Text |
2016 |
DBLP BibTeX RDF |
|
22 | Reto Baumgartner |
Morphological analysis and lemmatization for Swiss German using weighted transducers. |
KONVENS |
2016 |
DBLP BibTeX RDF |
|
22 | Michael Percillier |
Verb lemmatization and semantic verb classes in a Middle English corpus. |
KONVENS |
2016 |
DBLP BibTeX RDF |
|
22 | Enis Arslan, Umut Orhan |
Graph-based lemmatization of Turkish words by using morphological similarity. |
INISTA |
2016 |
DBLP DOI BibTeX RDF |
|
22 | Ladislav Gallay, Marián Simko |
Utilizing Vector Models for Automatic Text Lemmatization. |
SOFSEM |
2016 |
DBLP DOI BibTeX RDF |
|
22 | Steffen Eger, Rüdiger Gleim, Alexander Mehler |
Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art. |
LREC |
2016 |
DBLP BibTeX RDF |
|
22 | Ranka Stankovic, Cvetana Krstev, Ivan Obradovic, Biljana Lazic, Aleksandra Trtovac |
Rule-based Automatic Multi-word Term Extraction and Lemmatization. |
LREC |
2016 |
DBLP BibTeX RDF |
|
22 | Garrett Nicolai, Grzegorz Kondrak |
Leveraging Inflection Tables for Stemming and Lemmatization. |
ACL (1) |
2016 |
DBLP DOI BibTeX RDF |
|
22 | Jacek Malyszko, Witold Abramowicz, Agata Filipowska, Tomasz Wagner |
Lemmatization of Multi-Word Entity Names for Polish Language Using Rules Automatically Generated Based on the Corpus Analysis. |
LTC |
2015 |
DBLP DOI BibTeX RDF |
|
22 | Yudong Liu, Clinton Burkhart, James Hearne, Liang Luo |
Enhancing Sumerian Lemmatization by Unsupervised Named-Entity Recognition. |
HLT-NAACL |
2015 |
DBLP DOI BibTeX RDF |
|
22 | Thomas Müller 0009, Ryan Cotterell, Alexander M. Fraser, Hinrich Schütze |
Joint Lemmatization and Morphological Tagging with Lemming. |
EMNLP |
2015 |
DBLP DOI BibTeX RDF |
|
22 | Cezary Kaliszyk, Josef Urban, Jirí Vyskocil |
Lemmatization for Stronger Reasoning in Large Theories. |
FroCos |
2015 |
DBLP DOI BibTeX RDF |
|
22 | Derwin Suhartono, David Christiandy, Rolando Rolando |
Lemmatization Technique in Bahasa: Indonesian Language. |
J. Softw. |
2014 |
DBLP DOI BibTeX RDF |
|
22 | Karel Kucera, Martin Stluka |
Data processing and lemmatization in digitized 19th-century Czech texts. |
DATeCH |
2014 |
DBLP DOI BibTeX RDF |
|
22 | Mary Ting, Rabiah Abdul Kadir, Tengku Mohd Tengku Sembok, Fatimah Ahmad, Azreen Azman |
Adaptive Learning for Lemmatization in Morphology Analysis. |
SoMeT (Selected Papers) |
2014 |
DBLP DOI BibTeX RDF |
|
22 | Mary Ting, Rabiah Abdul Kadir, Tengku Mohd Tengku Sembok, Fatimah Ahmad, Azreen Azman |
Adaptive Learning for Lemmatization in Morphology Analysis. |
SoMeT |
2014 |
DBLP DOI BibTeX RDF |
|
22 | Michal Konkol, Miloslav Konopík |
Named Entity Recognition for Highly Inflectional Languages: Effects of Various Lemmatization and Stemming Approaches. |
TSD |
2014 |
DBLP DOI BibTeX RDF |
|
22 | Anand Kumar M, K. P. Soman |
AMRITA_CEN@FIRE-2014: Morpheme Extraction and Lemmatization for Tamil using Machine Learning. |
FIRE |
2014 |
DBLP DOI BibTeX RDF |
|