1
0
mirror of https://github.com/lorien/awesome-web-scraping.git synced 2024-11-28 08:48:58 +02:00

Merge pull request #99 from TheWoops/patch-3

Added 3 popular NLP-libraries
This commit is contained in:
Gregory Petukhov 2020-01-14 21:37:07 +03:00 committed by GitHub
commit 47c1f271a8
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -254,6 +254,9 @@ Libraries for parsing and manipulating specific text formats.
Libraries for working with human languages.
* [NLTK](http://www.nltk.org/) - A leading platform for building Python programs to work with human language data.
* [spacy](https://github.com/explosion/spaCy) - Enables using State-of-the-Art Deep Learning models for common NLP tasks.
* [fastai](https://github.com/fastai/fastai) - Deep Learning library with free video tutorials + active forum community, downside of lib: GPU needed
* [gensim](https://github.com/RaRe-Technologies/gensim) - library for topic modeling, document indexing and similarity retrieval with large corpora
* [Pattern](http://www.clips.ua.ac.be/pattern) - A web mining module for the Python. It has tools for natural language processing, machine learning, among others.
* [TextBlob](http://textblob.readthedocs.org/) - Providing a consistent API for diving into common NLP tasks. Stands on the giant shoulders of NLTK and Pattern.
* [jieba](https://github.com/fxsjy/jieba) - Chinese Words Segmentation Utilities.