Speech and Language Processing (3rd ed. draft) - by Dan Jurafsky and James H. Martin (Stanford):
sentencex - by Wikimedia:
https://github.com/wikimedia/sentencex
A sentence segmentation library with wide language support optimized for speed and utility.
Written in #Rust.
Bindings are available for #Python, #NodeJS and #WASM
Might be useful for my #SpeechToText system! 👀
sentencex - by Wikimedia:
https://github.com/wikimedia/sentencex
A sentence segmentation library with wide language support optimized for speed and utility.
Written in #Rust.
Bindings are available for #Python, #NodeJS and #WASM
Might be useful for my #SpeechToText system! 👀
For anyone working in #DigitalHumanities, #spaCy is a powerful good old Python #NLP library for processing text: It can identify word types, base forms (lemmas), sentence structure (dependency parsing), recognize named entities (NER), etc.
1/
Spacy Analyzer - a Hugging Fac...
For anyone working in #DigitalHumanities, #spaCy is a powerful good old Python #NLP library for processing text: It can identify word types, base forms (lemmas), sentence structure (dependency parsing), recognize named entities (NER), etc.
1/
Spacy Analyzer - a Hugging Fac...