An Empirical Study of Korean Sentence Representation with Various Tokenizations

An Empirical Study of Korean Sentence Representation with Various Tokenizations

It is important how the token unit is defined in a sentence in natural language process tasks, such as text classification, machine translation, and generation. Many studies recently utilized the subword tokenization in language models such as BERT, KoBERT, and ALBERT. Although these language models...

Full description

Bibliographic Details
Main Authors:	Danbi Cho, Hyunyoung Lee, Seungshik Kang
Format:	Article
Language:	English
Published:	MDPI AG 2021-04-01
Series:	Electronics
Subjects:	Korean sentence embedding subword tokenization sentiment analysis
Online Access:	https://www.mdpi.com/2079-9292/10/7/845

Similar Items

Ancient Korean Neural Machine Translation
by: Chanjun Park, et al.
Published: (2020-01-01)

Tokenization of Assets: Security Tokens in Liechtenstein and Switzerland
by: Angelika K. Layr
Published: (2021-09-01)

Token Phenomenon in Participatory Architectural Design and Sulukule Urban Transformation as a Tokenism Example
by: Baharak Fareghi Bavilolyaei, et al.
Published: (2018-07-01)

Word and Relation Embedding for Sentence Representation
Published: (2017)

Low-Power Embedded DSP Core for Communication Systems
by: Tsao Ya-Lan, et al.
Published: (2003-01-01)

Improving Sentence Representations via Component Focusing
by: Xiaoya Yin, et al.
Published: (2020-02-01)

Evaluation of Sentence Representations in Semantic Text Similarity Tasks
by: Balzar Ekenbäck, Nils
Published: (2021)

Repetitive subwords
by: Fazekas, Szilard Zsolt
Published: (2010)

Token Money or Cryptocurrency: technological Content and Economic Essence
by: A. V.  Varnavskiy
Published: (2018-11-01)

Improving Document-Level Sentiment Classification Using Importance of Sentences
by: Gihyeon Choi, et al.
Published: (2020-11-01)

Video Caption Based Searching Using End-to-End Dense Captioning and Sentence Embeddings
by: Akshay Aggarwal, et al.
Published: (2020-06-01)

Design and Investigation of Capsule Networks for Sentence Classification
by: Haftu Wedajo Fentaw, et al.
Published: (2019-05-01)

Non-fungible tokens
by: Idelberger, F., et al.
Published: (2022)

The Use of Response Tokens In Waiting for Godot by Samuel Beckett
by: Muhammad Izzul Islam, et al.
Published: (2016-09-01)

When the Teacher is the Token: Moving from Antiblackness to Antiracism
by: Manya C. Whitaker
Published: (2021-09-01)

An Adaptive Token Passing Algorithm Applicable to MS/TP Network
by: Ping Ren, et al.
Published: (2011-10-01)

Using sentence-level classification to predict sentiment at the document-level
by: Hutton, Amanda Rachel
Published: (2012)

Stateless Re-Association in WPA3 Using Paired Token
by: Byoungcheon Lee
Published: (2021-01-01)

The Use of Economy Token to Reduce Tantrum Among Autistic Students
by: Mohamad Kassim Mohamed Yaseen, et al.
Published: (2018-07-01)

Hamiltonicity of Token Graphs of Some Join Graphs
by: Luis Enrique Adame, et al.
Published: (2021-06-01)

Evaluating Philippine Students’ Class Participation with a Token Currency System
by: Stevenson Q. Yu
Published: (2017-12-01)

PENERAPAN TEKNIK TOKEN ECONOMY UNTUK MENINGKATKAN KEMANDIRIAN ANAK TK KARTIKA IV-21 MADIUN
by: Muh. Chotim, et al.
Published: (2016-11-01)

Between Love and Hate: The New Korean Wave, Japanese Female Fans, and Anti-Korean Sentiment in Japan
by: Ji-Hyun Ahn, et al.
Published: (2020-12-01)

Predictive Sentencing : Normative and Empirical Perspectives
Published: (2019)

On the Connectivity of Token Graphs of Trees
by: Fabila-Monroy, R., et al.
Published: (2022)

A Hybrid Approach to Cross-Linguistic Tokenization: Morphology with Statistics
by: Kearsley, Logan R.
Published: (2016)

The Improvement of The Discipline for Early Childhood Through Token Economy Technique
by: Elizabeth Prima, et al.
Published: (2018-12-01)

AN ENHANCED DYNAMIC TOKEN PROTOCOL FOR UNDERWATER ACOUSTIC SENSOR NETWORKS
by: Guangyu Fan, et al.
Published: (2012-12-01)

Blockchain and the Tokenization of the Individual: Societal Implications
by: Monique J. Morrow, et al.
Published: (2019-10-01)

MQTT-Auth: a Token-based Solution to Endow MQTT with Authentication and Authorization Capabilities
by: Marco Calabretta, et al.
Published: (2018-12-01)

A new short version of the “Token Test” for Yawi native speakers
by: Phakkharawat Sittiprapaporn
Published: (2019-08-01)

RAPID ACQUISITION OF REINFORCEMENT SENSITIVITY UNDER CONCURRENT TOKEN-PRODUCTION SCHEDULES
by: Smith, Travis Ray
Published: (2012)

Pengaruh Penggunaan Token Ekonomi dalam Menurunkan Perilaku Disruptif Anak
by: Indri Graecela Amalo, et al.
Published: (2020-07-01)

Effect Size and Moderators of Effects for Token Economy Interventions
by: Soares, Denise
Published: (2012)

Pengaruh Model Pembelajaran Kooperatif Tipe Time Token Terhadap Hasil Belajar Siswa SMP
by: Rosalina Sisilia Santriana Son
Published: (2019-09-01)

Effects of Fixed- and Variable-Ratio Token Exchange Schedules on Performance with Children with Autism
by: Greaves, Stephanie A.
Published: (2008)

Subword Complexes and Nil-Hecke Moves
by: M. A. Gorsky
Published: (2013-12-01)

Penerapan Model Pembelajaran Time Token untuk Meningkatkan Keterampilan Berbicara Siswa
by: Asnita Asnita, et al.
Published: (2020-05-01)

Token Economy (Hadiah) untuk Penyelesaian Tugas dalam Layanan Penguasaan Konten
by: Yeni Satroma Dewi, et al.
Published: (2015-10-01)

SwapCT: Swap Confidential Transactions for Privacy-Preserving Multi-Token Exchanges
by: Engelmann Felix, et al.
Published: (2021-10-01)