1 code implementation • 7 Nov 2023 • Taehee Jeon, BongSeok Yang, ChangHwan Kim, Yoonseob Lim
We introduce a morpheme-aware subword tokenization method that utilizes sub-character decomposition to address the challenges of applying Byte Pair Encoding (BPE) to Korean, a language characterized by its rich morphology and unique writing system.
no code implementations • 7 Jul 2023 • Bruce W. Lee, BongSeok Yang, Jason Hyung-Jong Lee
Though discourse parsing can help multiple NLP fields, there has been no wide language model search done on implicit discourse relation classification.
Discourse Parsing Implicit Discourse Relation Classification +3