Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Sascha Wolfer
Sascha Wolfer
@sascha_wolfer@fediscience.org  ·  activity timestamp 15 hours ago

What‘s your go-to #python or #rstats tool(chain) for splitting #German compounds? I‘ve tried a few but was not really satisfied. Maybe I missed something. #NLP #linguistics

  • Copy link
  • Flag this post
  • Block
Dr. Tim Schatto-Eckrodt
Dr. Tim Schatto-Eckrodt
@Kudusch@social.tchncs.de replied  ·  activity timestamp 14 hours ago

@sascha_wolfer Have you looked into Holmes? It’s build on top of #spacy and I remember it being able to extract tokens from compound words: https://github.com/richardpaulhudson/holmes-extractor

GitHub

GitHub - richardpaulhudson/holmes-extractor: Information extraction from English and German texts based on predicate logic

Information extraction from English and German texts based on predicate logic - richardpaulhudson/holmes-extractor
  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.2 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct