Meta's new multimodal translator uses a single model to speak 100 languages

We're getting tantalizingly close to Babelfish territory.

·Former Senior Editor

22 August 2023 at 9:30 am·2-min read

Though it's not quite ready to usher in the Doolittle future we've all been waiting for, modern AI translation methods are proving more than sufficient in accurately transforming humanity's roughly 6,500 spoken and written communication systems between one another. The problem is that each of these models tends to only do one or two tasks really well — translate and convert text to speech, speech to text or between either of the two sets — so you end up having to smash a bunch of models on top of each other to create the generalized performance seen in the likes of Google Translate or Facebook's myriad language services.

That's a computationally intensive process, so Meta developed a single model that can do it all. SeamlessM4T is "a foundational multilingual and multitask model that seamlessly translates and transcribes across speech and text," Meta's blog from Tuesday reads. It can translate between any of nearly 100 languages for speech-to-text and text-to-text functions, speech-to-speech and text-to-speech supports those same languages as inputs and outputs them in any of 36 others tongues, including English.

In their blog post, Meta's research team notes that SeamlessM4T "significantly improve[s] performance for the low and mid-resource languages we support," while maintaining "strong performance on high-resource languages, such as English, Spanish, and German." Meta built SeamlessM4T from its existing PyTorch-based multitask UnitY model architecture, which already natively performs the various modal translations as well as automatic speech recognition. It utilizes the BERT 2.0 system for audio encoding, breaking down inputs into their component tokens for analysis, and a HiFi-GAN unit vocoder to generate spoken responses.

Meta has also curated a massive open-source speech-to-speech and speech-to-text parallel corpus, dubbed SeamlessAlign. The company mined "tens of billions of sentences" and "four million hours" of speech from publicly available repositories to "automatically align more than 443,000 hours of speech with texts, and create about 29,000 hours of speech-to-speech alignments," per the blog. When tested for robustness, SeamlessM4T reportedly outperformed its (current state-of-the-art) predecessor against background noises and speaker style variations by 37 percent and 48 percent, respectively.

As with most all of its previous machine translation efforts — whether that's Llama 2, Massively Multilingual Speech (MMS), Universal Speech Translator (UST), or the ambitious No Language Left Behind (NLLB) project — SeamlessM4T is being open-sourced. "we believe SeamlessM4T is an important breakthrough in the AI community’s quest toward creating universal multitask systems," the team wrote. "Keeping with our approach to open science, we are excited to share our model publicly to allow researchers and developers to build on this technology." If you're interested in working with SeamlessM4T for yourself, head over to GitHub to download the model, training data and documentation.

Hello!
Rod Stewart, 79, rocked by shock divorce news: report
In a surprising turn of events Rod Stewart has received some shocking divorce news. See details.
Cosmo
Katy Perry rocked an underboob-baring diamanté bikini for 4th of July
To celebrate the 4th of July pop icon Katy Perry rocked an American flag themed bikini and referenced her hit 'Firework'
Parade
Kanye West's Wife Bianca Censori Shows Off Her Assets in Translucent Leggings and Tube Top
The couple was captured in photos at a California science museum on July 4.
Cosmo
Jade Thirlwall frees the nipple in a see-through chainmail top
Jade Thirlwall appears on the cover of beat magazine wearing a see-through chainmail top to free the nipple. See the image shared on social media, here.
Cosmo
Dua Lipa's dress is see-through, slinky and held together with safety pins 🧷🧷
Dua Lipa just rocked a tiny, see-through dress that was held together by safety pins, referencing Elizabeth Hurley's most iconic look
The Independent
Dad pleaded for daughter’s ex-boyfriend not to be freed from jail. Days later, her mutilated body was found in a car
‘If they let him out, he was going to kill her,’ Lauren Johansen’s father said he warned a judge just days before his daughter’s death
Yahoo News Australia
Deadly find hidden in suburban backyard soil triggers $200,000 fine
From the outside the home looked ordinary, but in the backyard investigators made a worrying discovery. Find out what it was.
Yahoo Sport Australia
Thanasi Kokkinakis in awful scenes as Alexei Popyrin produces stunning upset at Wimbledon
Thanasi Kokkinakis was left absolutely shocked after the incident. Find out more here.
Yahoo News Australia
Plea to Aussies after grim discovery along creek: 'More than a ute's worth'
The behaviour seems to be on the rise in Australia.
The Independent
A couple rented an Airbnb from a ‘friendly’ landlord. Then they wound up dismembered in a suitcase
Michael Lee Dudley seemed like a typical Washington homeowner – until three teenagers stumbled across a suitcase of horrors on a beach
HuffPost
Mary Trump Says There's Only 1 Thing 'Scarier' Than Her Uncle Becoming President Again
Donald Trump's niece called on voters to "roll up our sleeves and get to work" to secure democracy.
Yahoo Sport Australia
Ivan Cleary's emphatic call on leaving Panthers amid $13 million development with Nathan
The Panthers coach was adamant when asked about his future. Find out more here.
Yahoo Lifestyle
Kmart shoppers praise 'must-have' $1.75 kitchen gadget: 'A need not a want'
Kmart shoppers are going wild on social media over a brand-new product that’s both practical and adorable. Read more.
Yahoo Sport Australia
Ugly claims around Wayne Bennett worsen as South Sydney continue stunning NRL surge
The Dolphins coach's actions have been called out. Read more here.
The Daily Beast
Ex-Trump Staffer Alleges Campaign Settled Seedy Suits in Bombshell Filing
A thread of bombshell text messages made public Thursday alleged that Donald Trump’s 2020 campaign settled “multiple” seedy lawsuits for a man described only as “Boris,” leading to conjecture it could be a powerful Trumpworld figure.Those messages were revealed by A.J. Delgado—a former staffer on Trump’s 2016 campaign who’s embroiled in a lawsuit against the current campaign, alleging she was taken advantage of and raped by her former superior, Jason Miller.As part of that lawsuit, Delgado, a la
NewsWire
‘Be upfront:’ Albo dents rebel’s claims
A war of words has broken out between a rebel senator and Anthony Albanese, with the PM saying she should “be upfront” on a timeline of events.
The Independent
Royal news live: Prince Harry veteran award backlash continues as William watches Euros 2024 quarter-final
Harry has been defended for his ‘incredible’ work with the Invictus Games
Yahoo News Australia
Driver's 'selfish' car park fail ignites fury among Aussies: ‘Don't belong’
Australians are fed up with drivers of American-style utes behaving badly. Find out why.
Yahoo News Australia
'Crazy' creature found hiding on Aussie beaches stuns: 'Watch your toes'
Aussies and foreigners alike have been stunned to learn what wriggles beneath our toes at the beach.
Yahoo Finance AU
Tax warning as Aussies hit with $7,000 ATO bills: ‘Will happen’
A tax expert explains why you could owe money to the ATO this year.

Latest stories