Yuta Nakashima is an associate professor with Institute for Datability Science, Osaka University. His research interests include computer vision, pattern recognition, natural langauge processing, and their applications.


  • Computer Vision
  • Pattern Recognition
  • Natural Language Processing


  • PhD in Engineering, 2012

    Osaka University

  • ME, 2008

    Osaka University

  • BE, 2006

    Osaka University



Associate Professor

Institute for Datability Science, Osaka University

Jan 2017 – Present Osaka, Japan

Visiting Scholar

Robotics Institute, Carnegie Mellon University

Apr 2015 – Mar 2016 Pennsylvania, US

Assistant Professor

Nara Institute of Science and Technology

Apr 2012 – Dec 2016 Nara, Japan

Visiting Scholar

University of North Carolina at Charlotte

Feb 2012 – Mar 2012 North Carolina, US

JSPS Research Fellow (PD)

Osaia University

Feb 2012 – Mar 2012 Osaka, Japan

JSPS Research Fellow (DC2)

Osaia University

Oct 2008 – Jan 2012 Osaka, Japan



Recognition as Excellent Research Work and Collaboration

Open Paper Award

Infering what the videographer wanted to capture
Y. Nakashima and N. Yokoya

Recent Posts


Knowledge VQA

Visual question answering (VQA) with knowledge is a task that requires knowledge to answer questions on images/video. This additional requirement of knowledge poses an interesting challenge on top of the classic VQA tasks.

Recent Publications

ContextNet: representation and exploration for painting classification and retrieval in context

In automatic art analysis, models that besides the visual elements of an artwork represent the relationships between the different …

Speech-driven face reenactment for a video sequence

We present a system for reenacting a person’s face driven by speech. Given a video sequence with the corresponding audio track of …

Human shape reconstruction with loose clothes from partially observed data by pose specific deformation

Reconstructing the entire body of moving human in a computer is important for various applications, such as tele-presence, virtual …

Legal information as a complex network: Improving topic modeling through homophily

Topic modeling is a key component to computational legal science. Network analysis is also very important to further understand the …

BUDA.ART: A multimodal content-based analysis and retrieval system for Buddha statues

We introduce BUDA.ART, a system designed to assist researchers in Art History, to explore and analyze an archive of pictures of Buddha …