You need Adobe Reader 7.0 or later in order to read PDF files on this site.
If Adobe Reader is not installed on your computer, click the button below and go to the download site.
|
November 2015 Vol. 13 No. 11 |
|
Front-line Researchers
-
Kunio Kashino, Senior Distinguished Researcher, NTT Communication Science Laboratories
Overview As the volume of music, photographs, and video on the Internet continues to increase, the need for accurate and high-speed searching of media information is growing rapidly. We asked Dr. Kunio Kashino, Senior Distinguished Researcher at NTT Communication Science Laboratories, to tell us about the current state of research on media search in today¡Çs society and his thoughts on how researchers should view and approach their work.
Feature Articles: Communication Science as a Compass for the Future-
Embracing Information Science and Technology¡½Decoding, Exploring, and Designing the World
Abstract The era in which human beings are confronted with machines (computers or artificial intelligence) as disparate elements is coming to an end. From here on, we will embrace information science and technology as part of ourselves. This will necessitate the ability to decode, explore, and design the entire world, including us human beings. While bearing in mind the drastic changes in the information environment that we have experienced in the first fifteen years of the twenty-first century, we must think about what should make up the basic research that will form the compass of the future as we envision the year 2030, fifteen years from now.
-
Generative Modeling of Voice Fundamental Frequency Contours for Prosody Analysis, Synthesis, and Conversion
Abstract This article introduces a state-of-the-art technique that makes it possible to convert speech to different speaking styles through the manipulation of the fundamental frequency (F0) contour without destroying the naturalness of the speech. This technique can be used, for instance, to convert non-native speech to native-like speech, and to convert normal speech to speech with a more lively intonation similar to the way broadcasters speak. It can also be incorporated into text-to-speech systems to improve the naturalness of computer-generated speech.
-
Biological Measures that Reflect Auditory Perception
Abstract Brain processes involved in auditory perception are reflected in various physical/physiological responses. Our recent studies indicate that in addition to brainwaves, responses that might seem to have nothing to do with audition¡½for example, pupillary responses, sounds emitted from the ear, and rhythmic finger-tapping movements¡½actually provide information about subjective auditory experiences of listeners. These findings may lead to various applications such as designing auditory displays with pleasing sounds adapted to individual listeners and developing novel techniques for diagnosing or compensating for impaired hearing.
-
Deep Learning Based Distant-talking Speech Processing in Real-world Sound Environments
Abstract This article introduces advances in speech recognition and speech enhancement techniques with deep learning. Voice interfaces have recently become widespread. However, their performance degrades when they are used in real-world sound environments, for example, in noisy environments or when the speaker is some distance from the microphone. To achieve robust speech recognition in such situations, we must make progress in further developing various speech processing techniques. Deep learning based speech processing techniques are promising for expanding the usability of a voice interface in real and noisy daily environments.
-
Yu bi Yomu: A New Text Display System Using Tracing Behavior
Abstract NTT Communication Science Laboratories is researching a text display system called Yu bi Yomu, in which the appearance of text changes dynamically in response to the user¡Çs finger-tracing behavior. Research on digital text display has so far centered on discussions on how to achieve the feeling of using paper. However, digital text display has the potential to surpass attempts at simply imitating the paper medium by exploiting digital features in order to bring about major changes in the way that reading itself is performed. This article provides an overview of the Yu bi Yomu system and introduces the advantages of using this method.
-
Combinatorial Optimization Using Binary Decision Diagrams
Abstract Combinatorial optimization is being used to solve a wide range of real world tasks, but its application requires that we formulate the task as an optimization problem for which efficient methods for solving the problem exist. However, sometimes task-specific constraints prevent us from formulating the task as an easy-to-solve optimization problem. In this article, we present a new algorithm for solving combinatorial optimization problems by using a binary decision diagram (BDD), a data structure for representing a Boolean function as a compact graph. Our method can efficiently solve constraint-added variants of a class of optimization problems by representing the constraints with a BDD or zero-suppressed BDD (ZDD) and then applying an efficient dynamic programming algorithm.
Regular Articles-
Microscope Integrated with Optical Connector Cleaner for Cleaning and Inspecting Optical Fiber End-faces in a Single Operation
Abstract The end-faces of optical fibers must be kept clean because unclean end-faces can cause communication errors. When optical fibers are to be connected to each other, their end-faces are cleaned and inspected using two different devices in two separate time-consuming steps. First, a fiber cleaner is used to clean each end-face, and then a microscope is used to inspect each end-face. To simplify this process, we developed a device that integrates the cleaner and the microscope into a single tool, making it possible to perform both the cleaning and inspection in a single operation without having to change tools.
Global Standardization Activities-
Trends in Standardization Activities in China
Abstract The China Communications Standards Association (CCSA) is the sole organization in charge of standardization in the Chinese telecommunications industry. This article introduces recent developments in the CCSA¡Çs standardization activities and explains the structure of the Chinese standardization system and trends in the telecommunications industry.
Information-
Event Report: NTT Communication Science Laboratories Open House 2015
Abstract NTT Communication Science Laboratories Open House 2015 was held in Keihanna Science City, Kyoto, on June 4 and 5, 2015. Over 1200 visitors attended the event and enjoyed 5 talks and 30 exhibits focusing on our latest findings and activities in the fields of information and human sciences.
New NTT Colleagues
External Awards/Papers Published in Technical Journals and Conference Proceedings
|