speech recognition

New advances in speaker diarization

In a recent publication, “New Advances in Speaker Diarization,” presented virtually at Interspeech 2020, we describe our new state-of-the-art speaker diarization system that introduces several novel techniques.

Continue reading

IBM Research AI at ICASSP 2020

The 45th International Conference on Acoustics, Speech, and Signal Processing is taking place virtually from May 4-8. IBM Research AI is pleased to support the conference as a bronze patron and to share our latest research results, described in nine papers that will be presented at the conference.

Continue reading

A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

In a recently published paper in this year’s INTERSPEECH, we were able to achieve additional improvement on the efficiency of Asynchronous Decentralized Parallel Stochastic Gradient Descent, reducing the training time from 11.5 hours to 5.2 hours using 64 NVIDIA V100 GPUs.

Continue reading

IBM Research advances in end-to-end speech recognition at INTERSPEECH 2019

IBM scientists presented three papers at INTERSPEECH 2019 that address the shortcomings of End-to-end automatic approaches for speech recognition - an emerging paradigm in the field of neural network-based speech recognition that offers multiple benefits.

Continue reading

Help build the next generation of AI-driven dialog systems

IBM Research AI and the University of Michigan are organizing a public competition to inspire and evaluate novel approaches that will lead to the next generation of AI-driven dialog systems.

Continue reading

IBM Sets New Transcription Performance Milestone on Automatic Broadcast News Captioning

IBM sets new performance records for automatic captioning of broadcast news audio, with error rates of 6.5% and 5.9% on two broadcast news benchmarks.

Continue reading

High-Efficiency Distributed Learning for Speech Modeling

A distributed deep learning architecture for automatic speech recognition that shortens run time without compromising model accuracy.

Continue reading

IBM achieves new record in speech recognition

Depending on whom you ask, humans miss one to two words out of every 20 they hear. In a five-minute conversation, that could be as many 80 words. But, for most of us speech recognition isn’t a problem. Imagine, though, how difficult it is for a computer? Last year, IBM announced a major milestone in […]

Continue reading

Hear like a bat

Dynamic artificial bat ears enrich speech signals Bats use biosonar to navigate their night flights through jungles and forests. Their system of ultrasonic pulses can pinpoint sound more precisely than man-made technical sonar. To replicate these capabilities, Prof. Rolf Müller, an IBM Faculty Award winner, and his team at Virginia Tech designed artificial bat ears […]

Continue reading