Information

Vol. 18, No. 11, pp. 59–64, Nov. 2020. https://doi.org/10.53829/ntr202011in1

Event Report: NTT Communication Science Laboratories Open House 2020

Koji Kamei, Naotoshi Abekawa, Tomoki Ookuni,
Jun Muramatsu, Hiroshi Ueda, and Masataka Sawayama

Abstract

NTT Communication Science Laboratories Open House 2020 was held online, the content of which was published on the Open House 2020 web page at noon on June 4th, 2020. Within the first two days, around 2000 visitors enjoyed 5 talks and 31 exhibits, which included our latest research efforts in information and human sciences.

Keywords: information science, human science, artificial intelligence

PDF

1. Overview

NTT Communication Science Laboratories (CS Labs) aims to establish cutting-edge technologies that enable heartfelt communication between people and people and between people and computers. We are thus working on a fundamental theory that approaches the essence of being human and information science, as well as on innovative technologies that will transform society. NTT CS Labs’ Open House is held annually with the aim of introducing the results of our basic research and innovative leading-edge research with many hands-on intuitive exhibits to those who are engaged in research, development, business, and education.

Open House 2020 was, however, held via our website considering the recent situation against the spread of the novel coronavirus. The latest research results were published with recorded lecture videos on the Open House 2020 web page at noon on June 4th when the event was originally planned to start [1]. The content attracted many views in a month not only from NTT Group employees but also from businesses, universities, and research institutions. The event content is still available.

This article summarizes the event’s research talks and exhibits.

2. Keynote speech

Dr. Takeshi Yamada, vice president and head of NTT CS Labs, presented a speech entitled “I want to know more about you – Getting closer to humans with AI and brain science –,” in which he looked back upon the history of NTT and establishment of NTT CS Labs then introduced present and future cutting-edge basic research and technologies (Photo 1).


“I want to know more about you – Getting closer to humans with AI and brain science –”
Photo 1. Keynote speech (Dr. Takeshi Yamada).

Dr. Yamada pointed out the mission of NTT CS Labs to promote heartfelt communication between people and people or people and computers, by placing particular importance on pursuing artificial intelligence (AI) technologies that “approach and exceed human abilities” and science research to “obtain a deep understanding of people” and “make heartfelt contact.” Regarding today’s situation in which physical contact and face-to-face communication are significantly restricted, the talk pointed out that it has become even more important to identify the essence of emotional contact in heartfelt communication. The talk introduced NTT CS Labs’ recent AI technologies from three perspectives of getting closer to humans with voice and acoustic processing, understanding humans with language, and heartfelt communication with empathy and happiness (shi-awase) then declared to continue pursuing the essence of communication and tackle new challenges boldly and persistently.

3. Research talks

The following three research talks highlighted recent significant research results and high-profile research themes. Each talk introduced some of the latest research results and provided some background and an overview of the research.

(1) “Communication with desired voice – Deep generative model opens the way to innovative speech transformation –,” by Dr. Kou Tanaka, Media Information Laboratory

Dr. Tanaka introduced many technologies that apply deep learning, which is a common concept in AI, to speech information processing and various speech-to-speech conversions with high quality. NTT CS Labs considers speech information as an important tool for communication as well as the following four points as particularly important requirements for its conversion technology: 1) high quality, 2) learning from a small amount of data, 3) convertible in real time, and 4) covering various feature conversions such as voice characteristics, prosody, and accent. The research and development of such speech-to-speech conversion technology targets real-world application such as assistance for those with vocal disabilities, conversion of speaking style including emotion, and conversion of pronunciation and accent in language learning (Photo 2).


Photo 2. Research talk (Dr. Kou Tanaka).

(2) “Towards understanding human skin sensations – How the brain integrates spatio-temporal information in touch –,” Dr. Scinob Kuroki, Human Information Science Laboratory

Dr. Kuroki explained information processing of human skin sensations. She introduced a psychophysical experiment in which a human is presented with a tactile stimulus and asked the timing and position of the stimulus. A mysterious phenomenon occurred in which the answers differed from the stimulus given. The brain causes illusions by interpreting physical input stimuli, but by carefully unraveling such phenomena and illusions, she explored the mechanism by which the brain processes tactile information. She also introduced advanced approaches such as designing and generating tactile stimuli using a three-dimensional (3D) printer based on a method inspired by human visual information processing systems and a new direction of haptic science (Photo 3).


Photo 3. Research talk (Dr. Scinob Kuroki).

(3) “Which word is more difficult for you, ‘car’ or ‘vehicle’? – Estimation of text readability and human vocabulary size –,” Dr. Sanae Fujita, Innovative Communication Laboratory

Dr. Fujita introduced studies of two estimation methods, text readability and human vocabulary-size estimation. The estimation of human vocabulary size is based on the word familiarity database that NTT CS Labs has been creating for over 20 years. The database has recently been updated with more than 160,000 words and the Reiwa version of the vocabulary-size estimation test was published and made available as a demonstration [2]. She also introduced efforts aimed at tailor-made education support such as recommending books suitable for individual learners by combining vocabulary-size estimation and text-difficulty estimation (Photo 4).


Photo 4. Research talk (Dr. Sanae Fujita).

4. Research exhibition

The Open House featured 31 exhibits displaying NTT CS Labs’ latest research results. We categorized them into four areas: Science of Machine Learning, Science of Communication and Computation, Science of Media Information, and Science of Humans. Each exhibit prepared presentation slides with recorded explanation and published on the event web page (Photo 5). Several provided online demonstrations or demo videos to make them closer to direct demonstrations. The following list, taken from the Open House website, summarizes the research exhibits in each category.


Photo 5. Exhibition web page.

4.1 Science of Machine Learning

  • People of the WWW, give us your computation!
    – Generating datasets using people and information of the WWW –
  • Presenting a quick solution to system failures
    – Generating recovery-command sequences by neural networks –
  • Refining spatially aggregated data from cities
    – Multivariate Gaussian processes for spatially aggregated data –
  • Fast inference of accurate anomaly detector
    – Transfer anomaly detection for unseen datasets –
  • Anomaly detection with low false-positive rate
    – Semi-supervised learning for maximizing partial AUC –
  • Is the data really biased?
    – Testing combinatorial correlation by decision diagrams –

4.2 Science of Communication and Computation

  • What happens if every player rushes selfishly
    – Equilibrium computation of congestion games –
  • Handle a huge quantum world through a tiny window
    – Investigation of the ability of indirect quantum controls –
  • Tuning machine translation with small tuning data
    – Domain adaptation with JParaCrawl, a large parallel corpus –
  • Assessing children’s emotional development
    – Investigating developmental changes via multiple cues –
  • Creating a personalized picture book
    – Support for parent-child picture book interaction –
  • How many words do you know?
    – Vocabulary-size test, Reiwa edition –
  • Kyomachi Seika will guide you!
    – Training the role-playing AI with community cooperation –
  • What does he/she think in this situation?
    – Sentiment text generation based on personality –

4.3 Science of Media Information

  • Can you guess the age from the voice?
    – Deep speaker attribute estimation with speaker clustering –
  • More wireless microphones are available in a room
    – BRAVE: bit-error-robust low-delay audio and voice encoding –
  • Pay attention to the speaker you want to listen to (II)
    – Neural selective hearing with audio-visual speaker clues –
  • Controlling voice expression using face expression
    – Crossmodal voice expression control –
  • Learning to search like humans
    – Adaptive spotting for efficient object search –
  • Deep learning without data aggregation from nodes
    – Asynchronous consensus algorithm for ML (machine learning) over various NWs (networks) –
  • Cardiac model that makes it heart
    – Gaussian process with physical laws for 3D cardiac modeling –
  • Listening carefully to your heart beat
    – Cardiohemodynamical analysis based on stethoscopic sounds –

4.4 Science of Humans

  • Make natural-looking illusions by perceptual model
    – Adaptive motion retargeting for illusion-based projection AR (augmented reality) –
  • Tiny eye movements reflect cognitive states
    – Relation of eye-movement dynamics with cognition and pupil –
  • Haptic metameric textures
    – Direct control of perceived texture of 3D printed stimuli –
  • What causes emotional change?
    – Monitoring emotion in experimental settings and daily life –
  • Special cognitive abilities of e-sports experts
    – Performance, physiological state, and brain activity –
  • Realizing harmony in rugby scrum
    – Easy assessment of player coordination with wearable sensors –
  • What is a “straight” ball?
    – Physical and perceptual attributes of a pitched ball –
  • Body representation for quick and skillful action
    – Uncertainty of hand-state estimate regulates stretch reflex –
  • Unconscious is smarter than conscious
    – Environmental dependency in visuomotor responses –

5. Special lecture

We asked Professor Noriko Osumi, vice president of Tohoku University, to give a special lecture entitled “A challenge to scientifically understand ‘individuality.’” It is well known that people with autistic spectrum disorder sometimes make considerable achievements in fields such as art and scientific research. Her research group has regarded atypical development as individuality, and investigated the genetic and non-genetic mechanisms underlying such atypicality using mice as model animals. In particular, she found that paternal aging affected the early development of vocal communication of pups and introduced cutting-edge research topics of molecular mechanisms with changes in epigenome information. “Science of individuality” must be required in a diversity society. The lecture finished with providing a grand perspective from “ideal form of society” to “human evolution.”

6. Concluding remarks

Unlike last year, Open House 2020 was not held as an event at a physical venue but instead to present our latest results on a website. The lecture videos were viewed more than 10,000 times in June from various segments of users. Though it was difficult to have lively discussion, the long-term growth of visitors encourages us in further research activities. In closing, we would like to offer our sincere thanks to all the participants of this online event.

References

[1] Website of NTT Communication Science Laboratories Open House 2020,
http://www.kecl.ntt.co.jp/openhouse/2020/index_en.html
[2] A Japanese Word Familiarity Database (in Japanese),
http://www.kecl.ntt.co.jp/icl/lirg/resources/goitokusei/
Koji Kamei
Senior Research Scientist. Learning and Intelligent Systems Research Group, Innovative Communication Laboratory, NTT Communication Science Laboratories.
He received a B.E. and M.E. in electronic engineering from Kyoto University in 1995 and 1997. He joined NTT in 1997 and has been engaged in research on cooperative systems and information sharing and recommendation. He was also engaged in research on cooperative robotic services at Advanced Telecommunications Research Institute International (ATR) from 2009 to 2019.
Naotoshi Abekawa
Distinguished Researcher, Human and Information Science Laboratory, NTT Communication Science Laboratories.
He received a B.E. from Tokyo Metropolitan University, an M.E. from Tokyo Institute of Technology, and a Ph.D. from Kyoto University, in 2003, 2005, and 2013. He joined NTT in 2005 and has been engaged in research on human information processing. His research interests include human sensorimotor mechanisms, especially visuomotor control properties. He is a member of the Society for Neuroscience, the Japan Neuroscience Society, the Japanese Neural Network Society, and the Institute of Electronics, Information and Communication Engineers (IEICE).
Tomoki Ookuni
Senior Research Scientist, Research and Planning Section, NTT Communication Science Laboratories.
He received a B.A. in economics from Kyoto University in 1990 and an M.A. in marketing science from Osaka Prefecture University in 2004. He joined NTT in 1990 and has been engaged in managing research and development at both NTT WEST and NTT since 1998. He joined NTT Communication Science Laboratories in 2019.
Jun Muramatsu
Research Scientist, Learning and Intelligent Systems Research Group, Innovative Communication Laboratory, NTT Communication Science Laboratories.
He received a B.S., M.S., and Ph.D. in mathematics from Nagoya University, Aichi, in 1990, 1992, and 1998. He joined NTT in 1992 and has been engaged in research on information theory. From Feb. 2007 to Feb. 2008, he was a visiting researcher at ETH, Zurich, Switzerland. From 2006 to 2010, he was an associate editor of IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. He received the Young Researcher Award of SITA (the Society of Information Theory and Its Application) in 2003 and the 63rd Best Paper Award of IEICE in 2007. He is a member of IEICE and the IEEE Information Theory Society
Hiroshi Ueda
Research Scientist, Kashino Diverse Brain Research Laboratory, NTT Communication Science Laboratories.
He received a B.S. from University of British Columbia in 2008, an M.S. from Tokyo Institute of Technology in 2011, and a Ph.D. from The University of Tokyo in 2014. He joined NTT Communication Science Laboratories in 2015. His current research interests include various topics related to human visuomotor control.
Masataka Sawayama
Research scientist, Human and Information Science Laboratory, NTT Communication Science Laboratories.
He received a Ph.D. in experimental psychology from Chiba University in 2013 and joined NTT Communication Science Laboratories the same year. He is an expert in psychophysical research on human visual processing, in particular color and material perception. He is a member of the Vision Sciences Society and the Vision Society of Japan.

↑ TOP