Meta said the dependence of speech-to-speech translation models on text limits their efficiency. AFP
Meta said the dependence of speech-to-speech translation models on text limits their efficiency. AFP
Meta said the dependence of speech-to-speech translation models on text limits their efficiency. AFP
Meta said the dependence of speech-to-speech translation models on text limits their efficiency. AFP

Meta unveils its first speech-translation system for unwritten languages


Alkesh Sharma
  • English
  • Arabic

Facebook parent company Meta has released its first speech-to-speech translation system for spoken languages.

Developed under Meta’s Universal Speech Translator (UST) project, the system focuses on developing artificial intelligence systems that provide speech-to-speech translation across all languages.

Meta's AI researchers built translation systems for the Hokkien language — one of Taiwan’s official languages that is widely spoken within the Chinese diaspora but lacks a standard written form, the company said.

To allow other independent researchers to develop their own speech-to-speech models using Meta's technology, the California-based company has open-sourced the Hokkien translation model and released the data sets and speech matrix.

“Until now, AI translation has focused on written languages. Yet, of the more than 7,000 living languages, more than 40 per cent of languages are primarily oral and do not have a standard or widely known writing system,” Meta said in a blog.

“We plan to use our Hokkien translation system as part of a universal speech translator and will open source our model, code and training data for the AI community to enable other researchers to build on this work.”

The latest AI-driven technology allows Hokkien speakers to have conversations with people who speak English.

However, the technology can be extended to other unwritten languages and eventually will work in real time, Meta said, adding that more than 8,000 hours of Hokkien speech had been mined, together with the corresponding English translations.

While the model is still work in progress and can only translate one complete sentence at a time, “it is a step towards a future where simultaneous translation between languages is possible”, Meta said.

“We are releasing the speech matrix, a large corpus of speech-to-speech translations mined with Meta’s innovative data mining technique called Laser, which will enable researchers to create their own speech-to-speech translation systems and build on our work.”

Speech-to-speech translation systems have been developed over the past several years with top technology companies such as Alphabet and Microsoft rolling out similar products in the past.

Meta faced a number of limitations when developing direct speech-to-speech translation, including data gathering, model design and evaluation.

Most speech translation systems use text as an intermediary step. For example, speech in one language is first converted to text, then translated to text in the desired language and finally input into a text-to-speech system to generate audio.

This makes speech-to-speech translations dependent on text in ways that limit their efficiency and make them difficult to scale to languages that are primarily oral, Meta said.

Meanwhile, the direct speech-to-speech translation models enable the translation of languages that don’t have standardised writing systems.

This speech-based approach could lead to faster and more efficient translation systems as they will not require the additional steps of converting speech to text, translating it and then generating speech in the desired language.

Spoken communications can also help to break down barriers and bring people together wherever they are located — even in the metaverse, Meta said.

“AI research is helping to break down language barriers — both in the real world and the metaverse.

“In the future, all languages, whether written or unwritten, may no longer be an obstacle to mutual understanding. We look forward to contributing to this future of seamless communication,” the company said.

The metaverse is a digital space that allows users to communicate and move virtually in their three-dimensional avatars or digital representations.

Described as a successor to the internet, it is a set of immersive spaces shared by users, in which they can interact, innovate and engage other people who are not in the same physical location.

Specs

Engine: Dual-motor all-wheel-drive electric

Range: Up to 610km

Power: 905hp

Torque: 985Nm

Price: From Dh439,000

Available: Now

Infiniti QX80 specs

Engine: twin-turbocharged 3.5-liter V6

Power: 450hp

Torque: 700Nm

Price: From Dh450,000, Autograph model from Dh510,000

Available: Now

What the law says

Micro-retirement is not a recognised concept or employment status under Federal Decree Law No. 33 of 2021 on the Regulation of Labour Relations (as amended) (UAE Labour Law). As such, it reflects a voluntary work-life balance practice, rather than a recognised legal employment category, according to Dilini Loku, senior associate for law firm Gateley Middle East.

“Some companies may offer formal sabbatical policies or career break programmes; however, beyond such arrangements, there is no automatic right or statutory entitlement to extended breaks,” she explains.

“Any leave taken beyond statutory entitlements, such as annual leave, is typically regarded as unpaid leave in accordance with Article 33 of the UAE Labour Law. While employees may legally take unpaid leave, such requests are subject to the employer’s discretion and require approval.”

If an employee resigns to pursue micro-retirement, the employment contract is terminated, and the employer is under no legal obligation to rehire the employee in the future unless specific contractual agreements are in place (such as return-to-work arrangements), which are generally uncommon, Ms Loku adds.

Profile box

Company name: baraka
Started: July 2020
Founders: Feras Jalbout and Kunal Taneja
Based: Dubai and Bahrain
Sector: FinTech
Initial investment: $150,000
Current staff: 12
Stage: Pre-seed capital raising of $1 million
Investors: Class 5 Global, FJ Labs, IMO Ventures, The Community Fund, VentureSouq, Fox Ventures, Dr Abdulla Elyas (private investment)

The Africa Institute 101

Housed on the same site as the original Africa Hall, which first hosted an Arab-African Symposium in 1976, the newly renovated building will be home to a think tank and postgraduate studies hub (it will offer master’s and PhD programmes). The centre will focus on both the historical and contemporary links between Africa and the Gulf, and will serve as a meeting place for conferences, symposia, lectures, film screenings, plays, musical performances and more. In fact, today it is hosting a symposium – 5-plus-1: Rethinking Abstraction that will look at the six decades of Frank Bowling’s career, as well as those of his contemporaries that invested social, cultural and personal meaning into abstraction. 

UAE currency: the story behind the money in your pockets
SPECS%3A%20Polestar%203
%3Cp%3E%3Cstrong%3EEngine%3A%20%3C%2Fstrong%3ELong-range%20dual%20motor%20with%20400V%20battery%3Cbr%3E%3Cstrong%3EPower%3A%20%3C%2Fstrong%3E360kW%20%2F%20483bhp%3Cbr%3E%3Cstrong%3ETorque%3A%20%3C%2Fstrong%3E840Nm%3Cbr%3E%3Cstrong%3ETransmission%3A%20%3C%2Fstrong%3ESingle-speed%20automatic%3Cbr%3E%3Cstrong%3EMax%20touring%20range%3A%3C%2Fstrong%3E%20628km%3Cbr%3E%3Cstrong%3E0-100km%2Fh%3A%3C%2Fstrong%3E%204.7sec%3Cbr%3E%3Cstrong%3ETop%20speed%3A%3C%2Fstrong%3E%20210kph%20%3Cbr%3E%3Cstrong%3EPrice%3A%20%3C%2Fstrong%3EFrom%20Dh360%2C000%3Cbr%3E%3Cstrong%3EOn%20sale%3A%20%3C%2Fstrong%3ESeptember%3Cbr%3E%3C%2Fp%3E%0A
Updated: October 26, 2022, 7:16 AM