News!

2023.02.10: our paper has been presented at AAAI23 in Washington D.C., USA!

2022.10.25: our paper has been accpeted by AAAI23!

2022.03.01: our journal paper has been published in IEICE TRANSACTIONS on Information and Systems!

2020.07.01: our journal paper has been published in IJMDEM!

“Discover Excellence.” -The University of Tokyo Slogan

Resume

A self-motivated researcher with rich implementation experience in deep learning, including computer vision, multimodal processing, contrastive learning, and imbalanced learning.
Strong interest in media contents with subjective evaluation. Experienced in collaborative research with researchers from different fields. Also experienced in AI interface development.

Postdoctoral Fellowship

Department of Information and Communication Engineering, The University of Tokyo

04/2023 – Present

Ph.D. Degree in Information Science

Department of Information and Communication Engineering, The University of Tokyo

Thesis title: Presentation Skill Assessment Systems using Deep Neural Networks

04/2020 – 03/2023

Master’s Degree in Information Science

Department of Information and Communication Engineering, The University of Tokyo

Thesis title: Impression Analysis on Presentations using Attention-based LSTM

04/2018 – 03/2020

Japanese Language School

Fuji International Language Institute

07/2016 – 03/2018

Bachelor’s Degree in Engineering

Software Engineering, Sun Yat-sen University

Thesis title: Plant Recognition Based on GoogleNet

10/2012 – 07/2016

Research

Research experience in computer vision, multimodal processing, and natural language processing. Current research on contrastive learning and prompt learning.

Experiences


Experienced in collaborative research with companies and research institutes, and cooperation with other engineers.

Collaborative research with Rubato Co., Ltd.

Research Engineer

04/2019 - Present
Design assessment system of presentation slides using visual and structural features.
Published in journal: IEICE TRANSACTIONS on Information and Systems.
Collaborative research with PRAP Japan, Inc.

Research Engineer

10/2018 - 09/2019
Evaluation system of conference speaker using attention-based LSTM.
Published in journal: International Journal of Multimedia Data Engineering and Management.
Collaborative research with Keio University.

Research Engineer

11/2021 - 03/2022
Topic and user satisfaction analysis of a remote dialogue system in mental health intervention.
Collaborative research with Talent and Assessment Inc.

Research Engineer

02/2021 - Present
Evaluation system of online interview using multimodal Transformer.

Publications

(1) Journals

1. Shengzhou Yi, Junichiro Matsugami, and Toshihiko Yamasaki. Assessment System of Presentation Slide Design using Visual and Structural Features. IEICE TRANSACTIONS on Information and Systems, vol. E105-D, no. 3, pp. 587-596, 2022. [URL]

2. Shengzhou Yi, Koshiro Mochitomi, Isao Suzuki, Xueting Wang, and Toshihiko Yamasaki. Attention-based Multimodal Neural Network for Automatic Evaluation of Press Conferences. International Journal of Multimedia Data Engineering and Management, vol. 11, issue 3, pp. 1-19, 2020. [URL]

(2) International Conferences

[Oral, with review]

3. Shengzhou Yi, Koshiro Mochitomi, Isao Suzuki, Xueting Wang, and Toshihiko Yamasaki. Attention-based LSTM for Automatic Evaluation of Press Conferences. In IEEE Third International Conference on Multimedia Information Processing and Retrieval (MIPR 2020), pp. 187-192, Aug. 6-8, 2020, Shenzhen, China. [URL]

4. Shengzhou Yi, Xueting Wang, and Toshihiko Yamasaki. Emotion and Theme Recognition of Music Using Convolutional Neural Networks. In MediaEval Benchmark Workshop (MediaEval 2019), Oct. 27-29, 2019, Sophia Antipolis, France.

5. Shengzhou Yi, Xueting Wang, and Toshihiko Yamasaki. Impression Prediction of Oral Presentation using LSTM and Dot-product Attention Mechanism. In IEEE Fifth International Conference on Multimedia Big Data (BigMM 2019), pp. 242‒246, Sep. 11-13, 2019, Singapore. [URL]

[Demo, with review]

6. Shengzhou Yi, Junichiro Matsugami, Hiroshi Yumoto, and Toshihiko Yamasaki. An Online Presentation Slide Assessment System Using Visual and Semantic Segmentation Features. In Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), Demo. Feb. 7-14, 2023, Washington D.C., USA.

7. Shengzhou Yi, Hiroshi Yumoto, Xueting Wang, and Toshihiko Yamasaki. PresentationTrainer: Oral Presentation Support System for Impression-related Feedback. In Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020), Demo, pp. 13644-13645. Feb. 7-12, 2020, New York, USA. [URL]

[Poster, without review]

8. Shengzhou Yi, Xueting Wang, and Toshihiko Yamasaki. Impression Prediction of Oral Presentation using LSTM and Dot-product Attention Mechanism. Third International Workshop on Symbolic-Neural Learning (SNL-2019), P-16, Jul. 11-12, 2019, Tokyo, Japan.

(3) Domestic Conferences and Symposia

9. Chen Fu, Yuesong Liu, Naoto Tanji, Hiroyuki Seshime, Shengzhou Yi, Ling Xiao, and Toshihiko Yamasaki. Constrained Advertisement Layout Generation based on Graph Neural Networks. Meeting on Image Recognition and Understanding (MIRU), Aug. 6-9, 2024, Kumamoto.

10. Tomoya Sugihara, Shuntaro Masuda, Shengzhou Yi, and Toshihiko Yamasaki. Price Prediction of Handmade Items using Multimodal Data. Image Engineering Technical Group (IE), IEICE Technical Report, Jun. 6-7, 2024, Niigata.

11. Shengzhou Yi, Toshiaki Yamasaki, and Toshihiko Yamasaki. AEnhancing Online Structured Job Interviews: A Comprehensive Personality Assessment Using Multimodal Neural Networks. Annual Conference of the Japanese Society for Artificial Intelligence (JSAI), May 28-31, 2024, Hamamatsu.

12. Shengzhou Yi, Toshiaki Yamasaki, and Toshihiko Yamasaki. Online Structured Job Interview Assessment Using Multimodal Transformer and Prompt Learning. Media Experience Virtual Environment (MVE), vol. 123, no. 228, pp. 34-39, Oct. 26-27, 2023, Muroran.

13. Shengzhou Yi, Junichiro Matsugami, Takuya Yamamoto, Yukiyoshi Katsumizu, and Toshihiko Yamasaki. Online Presentation Skill Training Systems Using Multi-Modal Neural Network. Meeting on Image Recognition and Understanding (MIRU), Demo-10, Jul. 25-28, 2023, Hamamatsu.

14. Shengzhou Yi, Toshiaki Yamasaki, and Toshihiko Yamasaki. An Assessment System of Online Structured Job Interviews Supported by Multi-Modal Deep Learning. Annual Conference of the Japanese Society for Artificial Intelligence (JSAI), Jun. 6-9, 2023, Kumamoto.

15. Shengzhou Yi, Junichiro Matsugami, Takuya Yamamoto, Yukiyoshi Katsumizu, and Toshihiko Yamasaki. A Presentation Training System Based on Multi-modal Neural Networks. Annual Conference of the Japanese Society for Artificial Intelligence (JSAI), Jun. 6-9, 2023, Kumamoto.

16. Shengzhou Yi, Toshiaki Yamasaki, and Toshihiko Yamasaki. Assessment System of Remote Structured Interview using Bimodal Neural Network. Image Engineering Technical Group (IE), IEICE Technical Report, Feb. 21-22, 2023, Sapporo.

17. Shengzhou Yi, Junichiro Matsugami, and Toshihiko Yamasaki. Presentation Slide Assessment System using Visual and Semantic Segmentation Features. Media Experience Virtual Environment (MVE), vol. 122, no. 175, pp. 16-21, Sep. 8-9, 2022, Tokyo.

18. Shengzhou Yi, Junichiro Matsugami, and Toshihiko Yamasaki. Presentation Slide Design Evaluation using Two-Level Vision Transformer. Meeting on Image Recognition and Understanding (MIRU), IS2-88, Jul. 25-28, 2022, Himeji.

19. Shengzhou Yi, Toshiaki Kikuchi, and Toshihiko Yamasaki. User Satisfaction Prediction for Dialogue System in Mental Health Interventions. Image Engineering Technical Group (IE), IEICE Technical Report, vol. 121, no. 374, pp. 1-6, Feb. 21-22, 2022, Sapporo.

20. Shengzhou Yi, Junichiro Matsugami, and Toshihiko Yamasaki. Identifying Design Problems of Presentation Slides using a Bimodal Neural Network. Media Experience Virtual Environment (MVE), IEICE Technical Report, vol. 121, no. 179, pp. 21-26, Sep. 17-28, 2021, Tokyo.

21. Shengzhou Yi, Li Tao, Xueting Wang, and Toshihiko Yamasaki. Class-Balanced Contrastive Pre-Training for Improving Long-Tailed Recognition. Meeting on Image Recognition and Understanding (MIRU). Jul. 27-30, 2021, Nagoya.

22. Shengzhou Yi, Junichiro Matsugami, Xueting Wang, and Toshihiko Yamasaki. Slide Design Assessment Featuring Visual and Structural Analysis. 人工知能と知識処理研究会 (AI), IEICE Technical Report, vol. 120, no. 281, pp. 13-18, Dec. 10-11, 2020, Hamamatsu.

23. Shengzhou Yi, Takuya Yamamoto, Osamu Yamamoto, Yukiyoshi Katsumizu, Hiroshi Yumoto, Xueting Wang, Toshihiko Yamasaki. Make Your Presentation Better: Oral Presentation Support System using Linguistic and Acoustic Features. Image Engineering Technical Group (IE), IEICE Technical Report, vol. 119, no. 421, pp. 317-322, Feb. 27-28, 2020, Sapporo.

24. Shengzhou Yi, Xueting Wang, and Toshihiko Yamasaki. CNN-based Music Emotion and Theme Recognition Featuring Shallow Architecture. Media Experience Virtual Environment (MVE), IEICE Technical Report, vol. 119, no. 386, pp. 99-100, Jan. 23-24, 2020, Nara.

25. Shengzhou Yi, Koshiro Mochitomi, Isao Suzuki, Xueting Wang, Toshihiko Yamasaki. Automatic Evaluation of Press Conferences Using LSTM with Self-Attention Mechanism. Human Communication Group (HCG) Symposium, Dec. 11-13, 2019, Hiroshima.

26. Shengzhou Yi, Wang Xueting, and Yamasaki Toshihiko. Impression Prediction of Oral Presentation Using LSTM with Dot-product Attention Mechanism. Media Experience Virtual Environment (MVE), IEICE Technical Report, vol. 119, no. 75, pp. 1-6, Jun. 10-11, 2019, Tokyo.

27. Shengzhou Yi, Toshihiko Yamasaki, Izumi Masumura, Yoshinori Yasui, Takako Misaki, Nobuhiko Okabe. Prediction of the National Epidemiological Surveillance of Infectious Diseases Using LSTM. Image Media Processing Symposium (IMPS), P-1-11, Nov. 19-21, 2018, Gotemba.

Awards

IDR User Platform Excellence Award and Enterprise Award (IDRユーザフォーラム 企業賞、優秀賞), co-author [URL]

JSAI Annual Conference Award (人工知能学会 全国大会優秀賞) [URL]

MVE Award (メディアエクスペリエンス・バーチャル環境基礎研究会 MVE賞)[URL]

Fundings

Grant-in-Aid for Early-Career Scientists, Japan Society for the Promotion of Science (JSPS). (FY2024 - FY2026)

Activities

ACMMM 2024 Program Committee Member

ICME 2023/2024 Program Committee Member

AAAI 2021/2022 Program Committee Member

MM Asia 2022 Program Committee Member

Scientific Reports Reviewer

IEICE Transactions on Information and Systems Reviewer