banner_image ×
SeaArt AI Enterprise Version

A group of girls with headphones and a man with a backpack

Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2
image
avatar
В
Валерий
Generation Info
Remix
Prompts
Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1 , 000 languages , which is ten times more than currently . The company believes that this is a significant step in preserving languages that are on the verge of extinction . Meta has released its models to the public on GitHub . She claims that such a solution will help developers working in different languages to create new speech applications , such as instant messengers that can understand everyone , or virtual reality systems that can be used in any language . There are about 7 , 000 languages in the world , but existing speech recognition models cover only about 100 of them . Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts . The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1 , 107 languages , as well as unlabeled audio recordings of the New Testament in 3 , 809 languages . The team processed the speech audio and text data to improve its quality , and then ran an algorithm designed to align the audio recordings with the accompanying text . After that , they repeated this process using a second algorithm trained on the new aligned data . With this method , the researchers were able to train the algorithm to learn new languages faster , even without accompanying text . However , the team cautions that the model is still prone to errors in the transcription of some words or phrases , which can lead to inaccurate or potentially offensive labels . They also admit that their speech recognition models produce more biased words than other models , although only 0 . 7% more . --auto --s2
Size
3072X2048
Date
Jun 12, 2023
Mode
Default
Type
upscale
Checkpoint & LoRA
Checkpoint
ReV Animated
0 comment
0
2
0
0/400

SeaArt Swift AI Apps

ai_video_generationimg
AI Video Generation

Unleash your imagination and let AI create visual wonders for you

face_swap_titleimg
Face Swap Online Free

Create funny or realistic face swap videos & photos in a snap

DisneyFilter_top_titleimg
Disney Filter

Instantly transform your photos into Disney characters.

kiss_vidimg
AI Kissing Video Generator

Join the kissing trend with SeaArt's AI kissing video generator instantly. Easily make two people kiss, and create realistic animation.

wanimg2vid_h1img
Wan 2.1 Image to Video

Animate photos with realistic motion and cinematic effects.

fuse_anyoneimg
AI Image Fusion

Combine two images into new one stunning visual with AI Image Fusion.

Explore More AI Apps 

Explore Related

ControlNet
logo
English
Application
Create Image AI Characters Swift AI Model Training Canvas AI Apps Workflow
About
Studio Rankings AI Chat AI Blog AI News
Help
Guides Customer Service
Get App
icon
Download on the
APP Store
icon
GET IT ON
Google Play
Follow Us
iconiconiconiconiconiconiconicon
© 2025 SeaArt, Inc.
Copyright Policy
Terms
Privacy 特定商取引法 資金決済法に基づく表示
More