banner_image ×
SeaArt AI Enterprise Version

There is a woman sitting at a desk with a laptop and headphones

Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2
chatIcon
I've got some secrets that I can't share with anyone else. Wanna listen?
Create AI Character
image
avatar
В
Валерий
Prompts
Copy
Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1 , 000 languages , which is ten times more than currently . The company believes that this is a significant step in preserving languages that are on the verge of extinction . Meta has released its models to the public on GitHub . She claims that such a solution will help developers working in different languages to create new speech applications , such as instant messengers that can understand everyone , or virtual reality systems that can be used in any language . There are about 7 , 000 languages in the world , but existing speech recognition models cover only about 100 of them . Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts . The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1 , 107 languages , as well as unlabeled audio recordings of the New Testament in 3 , 809 languages . The team processed the speech audio and text data to improve its quality , and then ran an algorithm designed to align the audio recordings with the accompanying text . After that , they repeated this process using a second algorithm trained on the new aligned data . With this method , the researchers were able to train the algorithm to learn new languages faster , even without accompanying text . However , the team cautions that the model is still prone to errors in the transcription of some words or phrases , which can lead to inaccurate or potentially offensive labels . They also admit that their speech recognition models produce more biased words than other models , although only 0 . 7% more . --auto --s2
INFO
Prompts
Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2
Negative Prompts
(nsfw:1.5),verybadimagenegative_v1.3, ng_deepnegative_v1_75t, (ugly face:0.8),cross-eyed,sketches, (worst quality:2), (low quality:2), (normal quality:2), lowres, normal quality, ((monochrome)), ((grayscale)), skin spots, acnes, skin blemishes, bad anatomy, DeepNegative, facing away, tilted head, {Multiple people}, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worstquality, low quality, normal quality, jpegartifacts, signature, watermark, username, blurry, bad feet, cropped, poorly drawn hands, poorly drawn face, mutation, deformed, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, extra fingers, fewer digits, extra limbs, extra arms,extra legs, malformed limbs, fused fingers, too many fingers, long neck, cross-eyed,mutated hands, polar lowres, bad body, bad proportions, gross proportions, text, error, missing fingers, missing arms, missing legs, extra digit, extra arms, extra leg, extra foot, ((repeating hair))
CFG Scale
7
Steps
20
Sampler
DPM++ 2M Karras
Seed
-1
Clip Skip
Image Size
2048 X 1360
Denoising Strength
0.2
Model
ReV Animated
Generate
Size
2048X1360
Date
Jun 12, 2023
Mode
Default
Type
upscale
Checkpoint & LoRA
Checkpoint
ReV Animated
0 comment
0
0
0

SeaArt Swift AI Apps

ai_video_generationimg
AI Video Generation

Unleash your imagination and let AI create visual wonders for you

face_swap_titleimg
Face Swap Online Free

Create funny or realistic face swap videos & photos in a snap

DisneyFilter_top_titleimg
Disney Filter

Instantly transform your photos into Disney characters.

fuse_anyoneimg
AI Image Fusion

Combine two images into new one stunning visual with AI Image Fusion.

wanimg2vid_h1img
Wan 2.1 Image to Video

Animate photos with realistic motion and cinematic effects.

ai_tools_2img
Remove Background

Remove backgrounds from any image in seconds.

Explore More AI Apps 

Explore Related

ControlNet
logo
English
Application
Create Image AI Characters Swift AI Model Training Canvas AI Apps Workflow
About
Studio Rankings AI Chat AI Blog AI News
Help
Guides Customer Service
Get App
icon
Download on the
APP Store
icon
GET IT ON
Google Play
Follow Us
iconiconiconiconiconiconiconicon
© 2025 SeaArt, Inc.
Copyright Policy
Terms
Privacy 特定商取引法 資金決済法に基づく表示
More