To view PDF files

You need Adobe Reader 7.0 or later in order to read PDF files on this site.
If Adobe Reader is not installed on your computer, click the button below and go to the download site.

Information

Vol. 20, No. 11, pp. 85–90, Nov. 2022. https://doi.org/10.53829/ntr202211in1

Event Report: NTT Communication Science Laboratories Open House 2022

Takaaki Tanaka, Sadao Hiroya, Xiaomeng Wu,
Masakazu Ishihata, Hiromi Narimatsu, Yuya Chiba,
Naohiro Tawara, Toyomi Meguro, and Tomoki Ookuni

Abstract

NTT Communication Science Laboratories Open House 2022 was held online, the content of which was published on the Open House 2022 web page at noon on June 2nd, 2022. The videos of 6 talks and 29 exhibits presented our latest research efforts in information and human sciences and were viewed more than 5000 times in the first month after the event.

Keywords: information science, human science, artificial intelligence

PDF PDF

1. Overview

NTT Communication Science Laboratories (CS Labs) aims to establish cutting-edge technologies that enable heart-to-heart communication between people and between people and computers. We are thus working on a fundamental theory that approaches the essence of human beings and information science, as well as on innovative technologies that will transform society. CS Labs’ Open House is held annually to introduce the results of our basic research and innovative leading-edge research with many hands-on intuitive exhibits to those who are engaged in research, development, business, and education.

Open House 2022 was held online, as done last year, considering the situation against the spread of COVID-19. The latest research results were published with recorded lecture videos on the Open House 2022 web page at noon on June 2nd [1]. The content attracted many views in a month from NTT Group employees as well as from businesses, universities, and research institutions. The event content is still available.

In the afternoon of the day before the online event, a pre-event was held at an NTT WEST’s open innovation facility QUINTBRIDGE in Osaka as the first on-site exhibition in three years. Visitors showed great interest in the eight exhibits, and the significance of the hands-on exhibits were reaffirmed.

This article summarizes the online event’s research talks and exhibits.

2. Keynote speech

Dr. Futoshi Naya, vice president and head of CS Labs, presented a speech entitled “Communication science that adapts to the changing present and creates a sustainable future – Aiming to create technology that brings harmony and symbiosis with people, society, and the environment –,” in which he looked back upon the history of NTT and establishment of CS Labs then introduced present and future cutting-edge basic research and technologies (Photo 1).


Photo 1. Keynote speech (Dr. Futoshi Naya). “Commu­nication science that adapts to the changing present and creates a sustainable future
– Aiming to create technology that brings harmo­ny and symbiosis with people, society, and the environment –.”

Over the past 30 years since its founding on July 4, 1991, CS Labs has been promoting world-class basic research with the mission of constructing fundamental theories on the essence of human beings and information science and creating innovative technologies that will bring about communication that “reaches the heart.”

After introducing the latest research results of CS Labs, Dr. Naya declared that CS Labs will promote synergies between different research areas to create new values other than research within each specific area, and explore new frontiers through interdisciplinary collaboration with various fields such as physics, basic mathematics, medicine, biology, social sciences, and philosophy. He also stated that CS Labs will engage in basic research in communication science to bring about natural harmony and symbiosis among people, society, and the environment.

3. Research talks

The following four research talks highlighted recent significant research results and high-profile research themes. Each talk introduced some of the latest research results and provided background and an overview of the research. After each talk, we received many questions from participants in real time during the question-and-answer (Q&A) session, and we could see that the viewers had a high interest in our researches.

3.1 “AI hears your voice as if it were ‘right next to you’ – Audio processing framework for separating distant sounds with close microphone quality –,” Dr. Tomohiro Nakatani, Media Information Laboratory

Dr. Tomohiro Nakatani introduced advanced speech-enhancement techniques for extracting high-quality speech signals from degraded recordings with microphones distant from speakers, as if close microphones recorded them. When distant microphones capture a speech signal, they also record reverberation, background noise, and voices from extraneous speakers, degrading the quality of the captured speech signal. The techniques include a unified model for executing joint processing of dereverberation, denoising, and source separation, as well as a switching mechanism for enabling high-quality processing with a small number of microphones. He also discussed integration with deep-learning-based speech enhancement, e.g., SpeakerBeam (Photo 2).


Photo 2. Research talk (Dr. Tomohiro Nakatani). “AI hears your voice as if it were ‘right next to you’ – Audio processing framework for separating distant sounds with close microphone quality –.”

3.2 “Making digital twin of your heart – Personal heart modeling with mobile sensing and its potential application –,” Dr. Kunio Kashino, Media Information Laboratory/Bio-Medical Informatics Research Center

Dr. Kunio Kashino introduced the basic research he and co-researchers are conducting to actualize a future of healthcare using digital technology. Expectations for healthcare in modern life using digital technology are increasing every day. To support a more comprehensive approach, they are attempting to digitally model and replicate certain states and functions of the human body. In this talk, he introduced their initial trials focusing on the heart’s electrical and mechanical activities. He also discussed possible applications for their approach, including early detection of diseases and rehabilitation after treatment (Photo 3).


Photo 3. Research talk (Dr. Kunio Kashino). “Making digital twin of your heart – Personal heart modeling with mobile sensing and its potential application –.”

3.3 “Smart traffic coordination via digital twins – Future possibilities of distributed deep learning –,” Dr. Kenta Niwa, Innovative Communication Laboratory

Dr. Kenta Niwa introduced his latest research on collective intelligence formation and distributed cooperative control via digital twins. For the next innovation in the machine learning field, his research aims to optimally control large-scale systems, e.g., Internet of Things devices in the overall city, by learning collective intelligence from accumulated data on distributed devices. In this talk, Dr. Niwa introduced several research projects associated with smart traffic coordination via learnable digital twins (Photo 4).


Photo 4. Research talk (Dr. Kenta Niwa). “Smart traffic coordination via digital twins – Future possibilities of distributed deep learning –.”

3.4 “Where does mathematics come from? How does it proceed? Where it goes? – Challenges to unsolved problems in number theory and quantum computation by symmetry-based analysis and geometry –,” Dr. Masato Wakayama, Institute for Fundamental Mathematics

Dr. Masato Wakayama introduced arithmetic geometry, automorphic representations, and graph theory and representation/number theory, which are the mathematicians’ specialties of the Institute for Fundamental Mathematics, which was inaugurated in October 2021. In this talk, with several important historical examples in mathematics, he spoke on what kind of mathematics we have studied, how mathematics is progressing, and where mathematical research is heading in the future. He introduced the research connecting quantum interaction and number theory, and mentioned its position in the stream of modern mathematics (Photo 5).


Photo 5. Research talk (Dr. Masato Wakayama). “Where does mathematics come from? How does it proceed? Where it goes? – Challenges to unsolved problems in number theory and quantum computation by symmetry-based analysis and geometry –.”

4. Research exhibition

The Open House featured 29 exhibits displaying CS Labs’ latest research results. We categorized them into four areas: Science of Machine Learning, Science of Communication and Computation, Science of Media Information, and Science of Humans. Each exhibit prepared videos explaining the latest results and published them on the event web page (Photo 6). Several provided online demonstrations or demo videos to make them similar to direct demonstrations. We also introduced four Q&A sessions for each of the four areas and received many questions from participants in real time during the sessions. As in the previous year, we also had a Q&A system in which page visitors could freely post questions and comments and CS Labs’ researcher answer them. There were more than 50 public questions, and some made showed the enthusiasm of the visitors, such as long question comments and professional questions.


Photo 6. Exhibition web page.

The following list, taken from the Open House website, summarizes the research exhibits in each category.

4.1 Science of Machine Learning

  • Distributed traffic coordination without traffic signals
    – Learning of collective intelligence via digital twins –
  • Efficient training of photonic AI
    – Accelerated learning of fine-layered optical neural networks –
  • Multiple AIs make better predictions
    – Bayesian ensemble learning for better generalization performance –
  • Training fast & lightweight neural networks
    – Pruning neural networks with iterative randomization –
  • How the events spread?
    – Learning time-evolving states via dynamic Hawkes processes –

4.2 Science of Communication and Computation

  • Rapid disaster recovery through efficient shelter management
    – Optimal shelter operation with sequential return of evacuees –
  • Translating with your favorite expressions
    – Lexically constrained neural machine translation –
  • Toward uncongested infrastructures under user-equality
    – Equilibrium optimization of combinatorial congestion games –
  • Looking for mistakes in machine translation
    – Post-editing support based on source-target word alignment –
  • Elderly-friendly speaking styles
    – Selecting voice and words for better understandability –
  • Talking with AIs about views from a vehicle
    – Casual-dialog system based on scenery and nearby information –
  • Toward secure cryptography against quantum attacks
    – Quantum algorithm for finding collisions of hash functions –
  • Where does the wonder of numbers come from?
    – Finding new arithmetic phenomena via generalized motives –

4.3 Science of Media Information

  • “Huh? What do you mean?” Summarize a long story short
    – Robust speech summarization against speech recognition errors –
  • Flexible bokeh renderer based on predicted depth
    – Deep generative model for learning depth and bokeh effects only from natural images –
  • Heart health monitoring with sounds and electric signals
    – Estimating heart activities from multichannel sounds and ECG signals –
  • Controlling facial expressions in face image from speech
    – Crossmodal action unit sequence estimation and image-to-image mapping –
  • Maintain comfortable visibility anytime, anywhere
    – Image blending with content-adaptive visibility predictor –

4.4 Science of Humans

  • Gazing and talking help infants learn
    – Elucidating effects of social cues on infants’ object learning –
  • Why do people hesitate to use contact tracing apps?
    – Social factors influencing adoption of COCOA –
  • Is the rising fastball a perceptual illusion?
    – Modifying pitched ball perception by VR –
  • Mental skills of esports experts revealed by brain measurement
    – The relationship between frontal neural oscillation and performance –
  • Unveiling the auditory system with a neural network
    – Approaches to cochlear implant and binaural processing –
  • How does mindfulness meditation reduce stress?
    – Autonomic and endocrinological variation by meditation style –
  • Measuring well-being through diverse aspects
    – Well-being in terms of mental states, values, and idea of self –
  • Faster walking by moving the wall forward
    – Vision-based speedometer regulates human walking –
  • Fingertip illusions direct the mind
    – How the brain decodes pulling sensations –
  • What do we want to touch?
    – Understanding of desire to touch using large-scale Twitter data –
  • Eyes as a window of our mind
    – Pupil size tracks subjective perceptual changes –

5. Special lecture

We asked Professor Kazuo Okanoya, professor of Advanced Comprehensive Research Organization, Teikyo University, to give a special lecture entitled “Animal mind and communicative behavior,” and conduct a panel discussion with CS Labs’ researchers Makio Kashino, Tessei Kobayashi, and Sadao Hiroya.

In the lecture, he talked about his research on bio-psychology of communication to speculate on the mental experiences of animals and elucidate the evolution and neural mechanisms of such mental experiences by measuring neural activities that correlate with behavior. He explained the need for a method of studying the “animal mind” as well as the science of the “human mind,” which is based on anthropomorphism, although the study of the “animal mind” has been discouraged as anthropomorphism. He explained that in communicative behavior, sending a certain signal changes the behavior of the receiver, which in turn allows the sender to gain some benefit, and he introduced videos of experiments with examples of animal behavior for such communication. He concluded by stating that the following are necessary for the mind of artificial intelligence: “communication,” similar to what animals do; “theory of mind,” a system for predicting the behavior of others; and “mirror neurons,” a system for converting the behavior of others into its own behavior.

6. Concluding remarks

Open House 2022 was held as an event to present our latest results on a website. The lecture videos were viewed more than 4000 times in June by various groups of users. Participants provided many valuable opinions that will encourage us in further research activities through interactive means of communication with CS Labs’ researchers such as the Q&A system and Q&A session at the exhibition and lectures. In closing, we would like to offer our sincere thanks to all the participants of this online event.

Reference

[1] Website of NTT Communication Science Laboratories Open House 2022,
http://www.kecl.ntt.co.jp/openhouse/2022/index_en.html
Takaaki Tanaka
Senior Research Scientist, Linguistic Intelligence Research Group, Innovative Communication Laboratory, NTT Communication Science Laboratories.
He received a B.S., M.S., and Ph.D. from Osaka University. He joined NTT laboratories in 1996 and has been engaged in research on natural language processing including machine translation and language analysis. His current research interests include natural language generation and crosslingual semantic textual similarity based on syntactic and semantic analysis. He is a member of Association for Computational Linguistics, Association for Natural Language Processing, and the Japanese Society of Artificial Intelligence (JSAI).
Sadao Hiroya
Senior Research Scientist, Sensory and Motor Research Group, Human Information Science Laboratory, NTT Communication Science Laboratories.
He received an M.E. and Ph.D. from the Tokyo Institute of Technology in 2001 and 2006. He joined NTT Communication Science Laboratories in 2001. From 2001 to 2003, he was also a researcher on the CREST project of the Japan Science and Technology Agency. He was a visiting scholar at Boston University from 2007 to 2008. He received the first Itakura Prize Innovative Young Researcher Award and the ASJ Activity Contribution Award by the Acoustical Society of Japan (ASJ). His current research interests include the links between production and perception of speech, functional brain imaging, and musical performance. He is a member of the Institute of Electrical and Electronics Engineers (IEEE), the Institute of Electronics, Information and Communication Engineers (IEICE), and ASJ.
Xiaomeng Wu
Senior Research Scientist, Recognition Research Group, Media Information Laboratory, NTT Communication Science Laboratories.
He received a B.S. from the University of Shanghai for Science and Technology, China, in 2001, and M.S. and Ph.D. from the University of Tokyo, Japan, in 2004 and 2007. He joined NTT in 2013 and has been engaged in research on image retrieval and recognition. His current research interests include computer vision, image processing, and multimedia. He is a member of IEEE and IEICE.
Masakazu Ishihata
Senior Research Scientist, Learning and Intelligent Systems Research Group, Innovative Communication Laboratory, NTT Communication Science Laboratories.
He received a B.E., M.E., and Ph.D. in engineering from Tokyo Institute of Technology in 2008, 2010, and 2013. He joined NTT laboratories in 2013 and has been engaged in research on artificial intelligence (AI) and machine learning (ML). His current research interests include discrete structure manipulation systems and their application to AI and ML. He received the JSAI Incentive Award in 2010 and the JSAI Best Paper Award in 2013.
Hiromi Narimatsu
Senior Research Scientist, Human Information Science Laboratory and Innovative Communication Laboratory, NTT Communication Science Laboratories.
She received an M.E. and Ph.D. in engineering from the University of Electro-Communications, Tokyo, in 2011 and 2017. She joined NTT in 2011. Her research interests include natural language processing, spoken dialogue systems, and mathematical modeling. She is a member of IEEE, IEICE, Information Processing Society of Japan (IPSJ) and JSAI.
Yuya Chiba
Research Scientist, Interaction Research Group, Innovative Communication Laboratory, NTT Communication Science Laboratories.
He received a B.E., M.E., and Ph.D. in engineering from Tohoku University, Miyagi, in 2010, 2012, and 2015. He was an assistant professor at the Graduate School of Engineering, Tohoku University, from 2016 to 2020. He is currently a research scientist at NTT Communication Science Laboratories. His research interests include spoken dialogue systems, multimodal dialogue systems, and human-centric interfaces.
Naohiro Tawara
Research Scientist, Media Information Laboratory, NTT Communication Science Laboratories.
He received a Ph.D. in engineering from Waseda University, Tokyo, in 2016. After serving as a lecturer at Waseda University, he joined NTT Communication Science Laboratories in 2019. He has been engaged in research on speech processing including speaker recognition, age estima­tion, and language modeling. His current research interests include profiling human attributes
such as age, height, and emotion from voice. He is a member of IEEE, IEICE, IPSJ, and ASJ. He received the Awaya Prize Young Researcher Award from ASJ in 2018 and Yamashita
SIG Research Award from IPSJ in 2019.
Toyomi Meguro
Research Scientist, Interaction Research Group, Innovative Communication Laboratory and Research and Planning Section, NTT Communication Science Laboratories.
She received an M.E. in engineering from Tohoku University, Miyagi, in 2008 and joined NTT Communication Science Laboratories the same year. Her research interests include spoken dialogue systems and social skill training. She received the COLING 2010 Best paper finalist from the COLING in 2010 and the JSAI Annual Conference Award from JSAI in 2014.
Tomoki Ookuni
Senior Research Scientist, Research and Planning Section, NTT Communication Science Laboratories.
He received a B.A. in economics from Kyoto University in 1990 and M.A. in marketing science from Osaka Prefecture University in 2004. He joined NTT in 1990 and has been engaged in managing research and development at both NTT WEST and NTT since 1998. He joined NTT Communication Science Laboratories in 2019.

↑ TOP