What‘s your go-to #python or #rstats tool(chain) for splitting #German compounds? I‘ve tried a few but was not really satisfied. Maybe I missed something. #NLP #linguistics
What‘s your go-to #python or #rstats tool(chain) for splitting #German compounds? I‘ve tried a few but was not really satisfied. Maybe I missed something. #NLP #linguistics
@sascha_wolfer Have you looked into Holmes? It’s build on top of #spacy and I remember it being able to extract tokens from compound words: https://github.com/richardpaulhudson/holmes-extractor