When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Rei Higuchi, Ryotaro Kawata, Naoki Nishikawa, Kazusato Oko, Shoichiro Yamaguchi, Sosuke Kobayashi, Seiya Tokui, Kohei Hayashi, Daisuke Okanohara, Taiji Suzuki
In Proceedings of Conference on Language Modeling (COLM) 2025, Oct. 2025. [arxiv]
Efficient Construction of Model Family through Progressive Training Using Model Expansion
Kazuki Yano, Sho Takase, Sosuke Kobayashi, Shun Kiyono, Jun Suzuki
In Proceedings of Conference on Language Modeling (COLM) 2025, Oct. 2025. [arxiv]
User-Guided Correction of Reconstruction Errors in Structure-from-Motion
Sotaro Kanazawa, Jinyao Zhou, Yuta Kikuchi, Sosuke Kobayashi, Chunyu Li, Fabrice Matulic, Takeo Igarashi, Keita Higuchi
ACM Conference on Intelligent User Interfaces (IUI), Mar. 2025. [paper]
Spike No More: Stabilizing the Pre-training of Large Language Models
Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki
In Proceedings of Conference on Language Modeling (COLM) 2025, Oct. 2025. (Preprint 2023.) [arxiv]
B2T Connection: Serving Stability and Performance in Deep Transformers
Sho Takase, Shun Kiyono, Sosuke Kobayashi, Jun Suzuki
In Proceedings of Findings of ACL 2023, July 2023. [paper] [arxiv] [code]
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi, Eiichi Matsumoto, Vincent Sitzmann
In Proceedings of NeurIPS 2022, Nov. 2022. [paper] [arxiv] [code] [project]
Diverse Lottery Tickets Boost Ensemble from a Single Pretrained Model
Sosuke Kobayashi, Shun Kiyono, Jun Suzuki, Kentaro Inui
In Proceedings of Workshop on Challenges & Perspectives in Creating Large Language Models (BigScience), May 2022. [paper] [arxiv] [bib]
VocabEncounter: NMT-powered Vocabulary Learning by Presenting Computer-Generated Usages of Foreign Words into Users' Daily Lives
Riku Arakawa, Hiromu Yakura, Sosuke Kobayashi
In Proceedings of CHI 2022, Apr. 2022. [paper] [bib]
Instance-Based Neural Dependency Parsing
Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Masashi Yoshikawa, Kentaro Inui
TACL (Transactions of the Association for Computational Linguistics), Vol. 9, Dec. 2021. [paper] [arxiv] [bib]
SHAPE: Shifted Absolute Position Embedding for Transformers
Shun Kiyono, Sosuke Kobayashi, Jun Suzuki, Kentaro Inui
In Proceedings of EMNLP 2021, Nov. 2021. [paper] [arxiv] [bib]
*Efficient Estimation of Influence of a Training Instance
Sosuke Kobayashi, Sho Yokoi, Jun Suzuki, Kentaro Inui
Journal of Natural Language Processing (*written in Japanese, extended version of the paper at SustaiNLP 2020), Vol. 28-2, June 2021. (論文賞; Best Paper Award) [paper] [arxiv] [code] [bib]
Efficient Estimation of Influence of a Training Instance
Sosuke Kobayashi, Sho Yokoi, Jun Suzuki, Kentaro Inui
In Proceedings of First Workshop on Simple and Efficient Natural Language Processing (SustaiNLP 2020), Nov. 2020. [paper] [arxiv] [code] [bib]
All Word Embeddings from One Embedding
Sho Takase, Sosuke Kobayashi
In Proceedings of NeurIPS 2020, Dec. 2020. [paper] [arxiv] [code] [bib]
Instance-based Learning of Span Representations: A
Case Study through Named Entity Recognition
Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho
Yokoi, Tatsuki Kuribayashi, Ryuto Konno, Kentaro Inui
In Proceedings of ACL 2020, July 2020.
[paper] [arxiv] [bib]
Train Sparsely, Generate Densely: Memory-Efficient Unsupervised Training of High-Resolution Temporal GAN
Masaki Saito, Shunta Saito, Masanori Koyama, Sosuke Kobayashi
International Journal of Computer Vision, May 2020.
[paper] [arxiv] [code]
Learning from Observation-Only Demonstration for
Task-Oriented Language Grounding via
Self-Examination
Tsu-Jui Fu, Yuta Tsuboi, Sosuke Kobayashi, Yuta
Kikuchi
In Proceedings of 3rd Workshop on Visually Grounded
Interaction and Language (ViGIL), Dec. 2019. [paper]
Data Interpolating Prediction: Alternative
Interpretation of Mixup
Takuya Shimada, Shoichiro Yamaguchi, Kohei Hayashi,
Sosuke Kobayashi
In Proceedings of 2nd Workshop on Learning with
Limited Labeled Data: Weak Supervision and Beyond (LLD),
May 2019. [paper]
[arxiv]
DQN-TAMER: Human-in-the-Loop Reinforcement Learning
with Intractable Feedback
Riku Arakawa, Sosuke Kobayashi, Yuya Unno, Yuta
Tsuboi, Shin-ichi Maeda
In Proceedings of 2nd Workshop on Human-Robot Teaming
Beyond Human Operational Speeds and Robot Teammates
Operating in Dynamic, Unstructured Environments
(RT-DUNE), May 2019. [arxiv]
Pointwise HSIC: A Linear-Time Kernelized
Co-occurrence Norm for Sparse Linguistic Expressions
Sho Yokoi, Sosuke Kobayashi, Kenji Fukumizu,
Jun Suzuki, Kentaro Inui
In Proceedings of EMNLP 2018, Oct. 2018. [paper]
[arxiv]
[slide]
[bib]
Unsupervised Learning of Style-sensitive Word
Vectors
Reina Akama, Kento Watanabe, Sho Yokoi, Sosuke
Kobayashi, Kentaro Inui
In Proceedings of ACL 2018, July 2018. [paper]
[arxiv]
[code]
[bib]
[project]
Contextual Augmentation: Data Augmentation by Words
with Paradigmatic Relations
Sosuke Kobayashi
In Proceedings of NAACL HLT 2018, June 2018. [paper]
[arxiv]
[code]
[bib]
Interactively Picking Real-World Objects with
Unconstrained Spoken Language Instructions
Jun Hatori*, Yuta Kikuchi*, Sosuke Kobayashi*,
Kuniyuki Takahashi*, Yuta Tsuboi*, Yuya Unno*, Wilson Ko,
Jethro Tan (* equally contributed)
In Proceedings of International Conference on Robotics
and Automation (ICRA) 2018, June 2018. (Best Paper Award
on Human-Robot Interaction (HRI)) [paper]
[arxiv]
[dataset]
[bib]
[project]
[video]
A subsequent work of this is on an autonomous robot system demo
(WSJ,
BBC,
NVIDIA GTC 2019)
A Neural Language Model for Dynamically Representing
the Meanings of Unknown Words and Entities in a
Discourse
Sosuke Kobayashi, Naoaki Okazaki, Kentaro
Inui
In Proceedings of IJCNLP 2017, Nov. 2017. [paper]
[arxiv]
[slide]
[bib]
[video]
Generating Stylistically Consistent Dialog Responses
with Transfer Learning
Reina Akama, Kazuaki Inada, Naoya Inoue, Sosuke
Kobayashi, Kentaro Inui
In Proceedings of IJCNLP 2017, Nov. 2017. [paper]
[bib]
An RNN-based Binary Classifier for the Story Cloze
Test
Melissa Roemmele, Sosuke Kobayashi, Naoya
Inoue, Andrew Gordon
In Proceedings of 2nd Workshop on Linking Models of
Lexical, Sentential and Discourse-level Semantics
(LSDSem), Apr. 2017. [paper]
[bib]
Explaining Potential Risks in Traffic Scenes by
Combining Logical Inference and Physics Simulation
Ryo Takahashi, Naoya Inoue, Yasutaka Kuriya, Sosuke
Kobayashi, Kentaro Inui
International Journal of Machine
Learning and Computing (IJMLC), Oct. 2016. [paper]
Tohoku at SemEval-2016 Task 6: Feature-based Model
versus Convolutional Neural Network for Stance
Detection
Yuki Igarashi, Hiroya Komatsu, Sosuke
Kobayashi, Naoaki Okazaki, Kentaro Inui
In Proceedings of 10th International Workshop on
Semantic Evaluation (SemEval 2016), June 2016. [paper]
[bib]
Dynamic Entity Representation with Max-pooling
Improves Machine Reading
Sosuke Kobayashi, Ran Tian, Naoaki Okazaki,
Kentaro Inui
In Proceedings of NAACL HLT 2016, June 2016. [paper]
[code]
[bib]
Recognizing Potential Traffic Risks through
Logic-based Deep Scene Understanding
Naoya Inoue, Yasutaka Kuriya, Sosuke Kobayashi,
Kentaro Inui
In Proceedings of 22nd ITS World Congress, Oct. 2015.
(Best of the Rest) [paper]