• #indieweb 2015-12-02
  • Prev
    Next
  • #indieweb
  • #dev
  • #wordpress
  • #meta
  • #stream
  • #microformats
  • #known
  • #events
#indieweb ≡
  • ←
  • →
2015-12-02 UTC
# 17:48
dhalgren`
kevinmarks_: yeah. and mostly phrase-based. Gets beaten in certain pairs despite its massive data advantage. Like en -> czech is consistently won by a czech system, incorporating lots of old-fashioned linguistic syntactic analysis (tectoML, transfer-based, using ideas of some prague school of syntax), a google-ish phrase-based system and a rules-based "fixer" of common mistakes. Chimera, they call the resulting monstrosity.