Meta presents six speech recognition and understanding research papers

August 23, 2023

1 min.

Meta, the parent company of Facebook, has presented six research papers at the International Speech Communication Association (INTERSPEECH 2023) conference in Dublin.

The papers focus on advances in speech recognition and understanding, including new methods for improving the accuracy of speech recognition, developing more robust spoken language understanding systems, and creating expressive speech synthesis models.

One of the papers, titled “Multi-head State Space Model for Speech Recognition,” introduces a new architecture that can improve the accuracy of speech recognition by capturing both local and global temporal patterns in speech data. The paper also presents a new model called the Stateformer, which achieves state-of-the-art results on the LibriSpeech speech recognition dataset.

Another paper, titled “Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding,” addresses the problem of inaccurate text representations in end-to-end spoken language understanding systems. The paper proposes a new method for training these systems that takes into account the confidence levels of automatic speech recognition (ASR) hypotheses.

The list includes EXPRESSO, which offers a new dataset (EXPRESSO) for expressive speech synthesis with 26 styles which discusses challenges, and proposes new training method. It also includes Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free & Hybrid Approaches. Meta compares this alignment-based, alignment-free, and hybrid approaches for activating smart devices with specific keywords.

Furthermore, Meta presented the MuAViC benchmark for speech translation, comprising a multilingual audio-visual corpus and evaluation metrics as well as ESPnet-SE++, a speech enhancement system designed to improve speech quality in noisy conditions.

The sources for this piece include an article in AnalyticsIndiaMag.

Tags
meta

TND Newsdesk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Meta presents six speech recognition and understanding research papers

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Is Oracle killing off MySQL?

Research Raises Concerns Over AI Impact on Code Quality

Microsoft to train 100,000 Indian developers in AI

NIST issues cybersecurity guide for AI developers

Become a member