Japanese multi-task CNN trained on UD-Japanese BCCWJ r2.8 + GSK2014-A(2019). Assigns word2vec token vectors. Components: tok2vec, parser, ner, morphologizer, atteribute_ruler, compound_splitter, bunsetu_recognizer. | |
https://github.com/megagonlabs/ginza | |
ja-ginza-5.2.0 | MIT |
download~amd64 ~x86 | pypi |