update lists

This commit is contained in:
2025-07-18 22:22:32 +02:00
parent 55bed3b4a1
commit 5916c5c074
3078 changed files with 331679 additions and 357255 deletions

View File

@@ -30,9 +30,10 @@ A curated list of anything remotely related to linguistics, sorted in alphabetic
* [Natural Language ToolKit (NLTK)](http://www.nltk.org/) - The most complete platform for building Python programs to work with human language data.
* [Snowball](https://snowballstem.org/) - Snowball is a language in which stemming algorithms can be easily represented.
* [Spacy](https://spacy.io/) - Industrial-strength National Language Processing in Python.
* [Mate Tools](http://hdl.handle.net/11022/1007-0000-0000-8E4E-A), webservice via [WebLicht](https://weblicht.sfs.uni-tuebingen.de/)
* [Mate Tools](http://hdl.handle.net/11022/1007-0000-0000-8E4E-A), webservice via WebLicht
* [UBIAI](https://ubiai.tools/) - Easy-to-use text annotation tool for teams with most comprehensive auto-annotation features. Supports NER, relations and document classification as well as OCR annotation for invoice labeling.
* [textblob-de](https://github.com/markuskiller/textblob-de) - Nice alternative for spacy (see above).
* [tyo](https://github.com/mongsvo/tyo) - A utility for finding Typo-Bridges.
* [UralicNLP](https://github.com/mikahama/uralicNLP) - An open source Python library for processing morphologically rich and, for the most part, endangered Uralic languages. It can do morphological analysis, generation, lemmatization, disambiguation and lexical lookup for a great many Uralic languages.
### Algorithms
@@ -55,7 +56,6 @@ A curated list of anything remotely related to linguistics, sorted in alphabetic
* [OpinionSpam](https://github.com/hdaSprachtechnologie/OpinionSpam)
### Resources
* [How To Label Data](https://www.lighttag.io/how-to-label-data/) - Guide on managing large scale linguistic annotation projects.
* [Low Resource Languages](https://github.com/RIchardLitt/low-resource-languages) - A list of resources for conservation, development, and documentation of low resource (human) languages.
* [Language Science Press](https://langsci-press.org/) - Language Science Press is a born-digital scholar-led open access publisher in linguistics.
@@ -126,8 +126,10 @@ A curated list of anything remotely related to linguistics, sorted in alphabetic
* [awesome-nlp-polish](https://github.com/ksopyla/awesome-nlp-polish)
* [awesome-spanish-nlp](https://github.com/dav009/awesome-spanish-nlp)
* [M. Weisser's list of NLP/Computational Linguistics Resources](https://martinweisser.org/corpora_site/comp_ling_resources.html)
* [NLP tools (Saarland University)](https://www.coli.uni-saarland.de/~csporled/page.php?id=tools)
### Communities
* [Linguistics Stack Exchange](https://linguistics.stackexchange.com/)
* [Untranslatable.co, Multilingual urban dictionary](https://untranslatable.co/)
[linguistics.md Github](https://github.com/theimpossibleastronaut/awesome-linguistics
)