Ryuto Koike

Ryuto Koike

Email: ryuto.koike [at] nlp.c.titech.ac.jp

About me

Hello! I am a final-year PhD student (est. March 2026) at the Institute of Science Tokyo, advised by Prof. Naoaki Okazaki. My research aims to develop principles and methods for building and ensuring safe, secure, and reliable AI systems, mitigating their negative societal implications. My current work focuses on membership inference attacks, AI-generated text detection, jailbreak defense, and reliable LLM-as-a-judge. My work has been published in top-tier conferences such as ACL, EMNLP, and AAAI, and I have led multiple collaborative projects with Prof. Chris Callison-Burch and Prof. Preslav Nakov. Particularly, my research on detecting AI-generated text has gathered notable attention with coverage in the Nikkei, received conference awards, and collectively has over 100 citations. Outside of academia, I am involved as a research advisor for a startup in Japan on multi-lingual text generation.

I am on the job market for 2026! Please reach out if you think my background and experience may be a good fit for your organization.

News

Oct. 2025: Our preprint on the transferability between MIA and AI text detection (collaboration with UPenn) is out!
Feb. 2025: Our preprint on the interpretability of AI text detection (collaboration with MBZUAI) is out!
Jan. 2025: Organized the GenAI Content Detection workshop at COLING 2025 @Abu Dhabi!
Oct. 2024: Started working at Chris Callison-Burch's group as a Visiting Researcher, University of Pennsylvania 🇺🇸
Sep. 2024: A first-author paper is accepted to EMNLP 2024 Findings on LLM detection @Miami 🎉
Aug. 2024: Glad to receive the Off-Campus Study Plus Research Fund and Leap for Tomorrow Scholarship for study abroad.
May. 2024: A co-author paper is accepted to ACL 2024 Findings on LLM-as-a-judge @Bangkok 🎉
Dec. 2023: A first-author paper is accepted to AAAI 2024 on LLM detection @Vancouver 🎉
Oct. 2023: Our paper OUTFOX on LLM detection was featured in Nikkei 🗞
Apr. 2023: Started my PhD journey at Okazaki Lab, Institute of Science Tokyo (formerly Tokyo Tech) 🇯🇵

Selected Works (*: equal contribution. †: undergraduate/master's mentee.) [ Google Scholar ]

	Machine Text Detectors are Membership Inference Attacks Ryuto Koike, Liam Dugan, Masahiro Kaneko, Chris Callison-Burch, Naoaki Okazaki Preprint 2025 arxiv / code TL;DR - We theoretically prove that membership inference attacks (MIA) and machine-generated text detection share the same optimal metric, and empirically demonstrate strong cross-task transferability (ρ > 0.6) across diverse domains and generators. Notably, a machine text detector outperforms a state-of-the-art MIA on MIA benchmarks. To support cross-task development and fair evaluation, we introduce MINT, a unified evaluation suite implementing 15 recent methods from both tasks.
	ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability Ryuto Koike, Masahiro Kaneko, Ayana Niwa, Preslav Nakov, Naoaki Okazaki Preprint 2025 arxiv / code TL;DR - We propose ExaGPT, an interpretable AI text detector that identifies a text by checking whether it shares more similar spans with human-written vs. machine-generated texts from a datastore and presents those spans as evidence for users to assess how reliably correct the decision is. ExaGPT achieves both high interpretability and significant performance, outperforming prior interpretable detectors by up to +37.0% accuracy at 1% FPR.
	OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki AAAI 2024 arxiv / data + code / technical appendix TL;DR - We propose OUTFOX, a framework that improves the robustness of AI text detectors by allowing both the detector and the attacker to adversarially learn from each other's output through in-context learning, achieving a +41.3% F1 improvement against strong adaptive attacks. This paper is among the first to effectively use AI to detect AI. 🏆 Double Sponsorship Awards (1/140 ≈ 0.7%) in YANS 📸 Featured in Nikkei, NAACL Tutorial, Originality.ai Blog 📈 130 citations in Google Scholar
	How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki EMNLP Findings 2024 arxiv / data TL;DR - We reveal the vulnerabiltiies of AI text detectors against prompt diversity in text generation. Specifically, even task-oriented constraints -- constraints that would naturally be included in an instruction and are not related to detection-evasion -- cause existing powerful detectors to degrade their detection performance. We highlight the importance of ensuring prompt diversity to build robust benchmarks grounded in real-world scenarios.
	Likelihood-based Mitigation of Evaluation Bias in Large Language Models Masanari Ohi†, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki ACL Findings 2024 arxiv / code TL;DR - We identify a self-preference bias in LLM-as-a-judge i.e., LLMs overrate texts with higher likelihoods while underrating those with lower likelihoods. We further propose a simple yet effective mitigation method via in-context learning, achieving better alignment with human evaluations. 🏆 Outstanding Young Researcher’s Paper (18/427 ≈ 4.2%) in ANLP

Experiences
	Institute of Science Tokyo, Tokyo, Japan Doctoral Researcher (2023.04 - Present) Advisor: Prof. Naoaki Okazaki
	University of Pennsylvania, Philadelphia, PA, USA Visiting Researcher (2024.10 - 2025.10) Advisor: Prof. Chris Callison-Burch
	Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), Abu Dhabi, UAE Research Collaborate (2024.04 - 2025.04) Advisor: Prof. Preslav Nakov
	Exawizards, Inc., Tokyo, Japan Machine Learning Engineer Intern (2022.02 - 2022.03)
	CyberAgent, Inc., Tokyo, Japan Research Intern (2021.09 - 2022.01) Software Engineer Intern (2021.07 - 2021.08)

Education
	Institute of Science Tokyo (formerly Tokyo Institute of Technology), Tokyo, Japan Ph.D. in Computer Science (2023.04 - est. 2026.04)
	Keio University, Tokyo, Japan M.S. in Information and Computer Science (2021.04 - 2023.03) B.S. in Information and Computer Science (2017.04 - 2021.03)

Grants

Off-Campus Study Plus in Tokyo Tech SPRING Scholarship
Tokyo Institute of Technology, 2024.
Research Funds: 900,000 JPY / APPROX 6,000 USD
Tobitate! (Leap for Tomorrow) Study Abroad Scholarship (Acceptance Rate: 16.7%)
The Ministry of Education, Culture, Sports, Science and Technology (MEXT), 2024.
Scholarship: 1,920,000 JPY / APPROX 12,100 USD per year, Preparation Funds: 350,000 JPY / APPROX 2,200 USD
Tokyo Tech SPRING Scholarship
Tokyo Institute of Technology, Apr. 2024 - Mar.2026.
Scholarship: 2,160,000 JPY / APPROX 14,400 USD per year, Research Funds: 300,000 JPY / APPROX 2,000 USD per year,
Full Tuition Exemption.
Tokyo Tech Advanced Human Resource Development Fellowship for Doctoral Students
Tokyo Institute of Technology, Apr. 2023 - Mar. 2024.
Scholarship: 1,800,000 JPY / APPROX 12,000 USD per year, Research Funds: 300,000 JPY / APPROX 2,000 USD per year,
Full Tuition Exemption.

Honors

Sponsorship Award from CyberAgent, Inc. (Top 2/246=0.8%)
The 20th Symposium of Young Researcher Association for NLP Studies (YANS 2025). Synthesizing Instruction-Tuning Datasets with Contrastive Decoding.
Sponsorship Award from Polaris.AI (Top 1/246=0.4%)
The 20th Symposium of Young Researcher Association for NLP Studies (YANS 2025). Proposal of Defense Against Multi-Turn Jailbreak Attacks.
Encouragement Award (Top 23/187=12.3%)
The 19th Symposium of Young Researcher Association for NLP Studies (YANS 2024). Easily Detectable LLMs Without Sacrificing Its Generative Capability.
Sponsorship Award from CyberAgent, Inc. (Top 2/187=1.1%)
The 19th Symposium of Young Researcher Association for NLP Studies (YANS 2024). Easily Detectable LLMs Without Sacrificing Its Generative Capability.
Sponsorship Award from PKSHA Technology (Top 1/140=0.7%)
The 18th Symposium of Young Researcher Association for NLP Studies (YANS 2023). OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples.
Sponsorship Award from HAKUHODO Technologies (Top 1/140=0.7%)
The 18th Symposium of Young Researcher Association for NLP Studies (YANS 2023). OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples.
Silver Medal (Top 165/4373=3.8%)
Kaggle, Mechanisms of Action (MoA) Prediction, 2020.

Academic Service

Reviewer/Program Committee: ACL (2023-2025), EMNLP (2024-2025), ICLR (2026), NeurIPS (2025), AAAI (2026), COLING (2025), AACL (2025)
Journal Reviewer: Journal of Natural Language Processing
Workshop/Shared Task Organizer: GenAI Content Detection (GenAIDetect), COLING 2025