banner_image ×
SeaArt AI Empresa

A group of girls with headphones and a man with a backpack

Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2
chatIcon
Algunas cosas solo deben saberlas tú y yo.
Crear personaje de IA
image

Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2

avatar
В
Валерий
Prompts
Copiar prompts
Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1 , 000 languages , which is ten times more than currently . The company believes that this is a significant step in preserving languages that are on the verge of extinction . Meta has released its models to the public on GitHub . She claims that such a solution will help developers working in different languages to create new speech applications , such as instant messengers that can understand everyone , or virtual reality systems that can be used in any language . There are about 7 , 000 languages in the world , but existing speech recognition models cover only about 100 of them . Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts . The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1 , 107 languages , as well as unlabeled audio recordings of the New Testament in 3 , 809 languages . The team processed the speech audio and text data to improve its quality , and then ran an algorithm designed to align the audio recordings with the accompanying text . After that , they repeated this process using a second algorithm trained on the new aligned data . With this method , the researchers were able to train the algorithm to learn new languages faster , even without accompanying text . However , the team cautions that the model is still prone to errors in the transcription of some words or phrases , which can lead to inaccurate or potentially offensive labels . They also admit that their speech recognition models produce more biased words than other models , although only 0 . 7% more . --auto --s2
INFO
Prompts
Meta has created artificial intelligence models capable of recognizing and delivering speech in more than 1,000 languages, which is ten times more than currently. The company believes that this is a significant step in preserving languages that are on the verge of extinction. Meta has released its models to the public on GitHub. She claims that such a solution will help developers working in different languages to create new speech applications, such as instant messengers that can understand everyone, or virtual reality systems that can be used in any language. There are about 7,000 languages in the world, but existing speech recognition models cover only about 100 of them. Meta solved this problem by retraining an existing AI model developed by the company in 2020 that can learn speech patterns from audio without the need for large amounts of labeled data such as transcripts. The model was trained on two new datasets: audio recordings of the New Testament of the Bible and corresponding text in 1,107 languages, as well as unlabeled audio recordings of the New Testament in 3,809 languages. The team processed the speech audio and text data to improve its quality, and then ran an algorithm designed to align the audio recordings with the accompanying text. After that, they repeated this process using a second algorithm trained on the new aligned data. With this method, the researchers were able to train the algorithm to learn new languages faster, even without accompanying text. However, the team cautions that the model is still prone to errors in the transcription of some words or phrases, which can lead to inaccurate or potentially offensive labels. They also admit that their speech recognition models produce more biased words than other models, although only 0.7% more. --auto --s2
Etiqueta negativa
(nsfw:1.5),verybadimagenegative_v1.3, ng_deepnegative_v1_75t, (ugly face:0.8),cross-eyed,sketches, (worst quality:2), (low quality:2), (normal quality:2), lowres, normal quality, ((monochrome)), ((grayscale)), skin spots, acnes, skin blemishes, bad anatomy, DeepNegative, facing away, tilted head, {Multiple people}, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worstquality, low quality, normal quality, jpegartifacts, signature, watermark, username, blurry, bad feet, cropped, poorly drawn hands, poorly drawn face, mutation, deformed, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, extra fingers, fewer digits, extra limbs, extra arms,extra legs, malformed limbs, fused fingers, too many fingers, long neck, cross-eyed,mutated hands, polar lowres, bad body, bad proportions, gross proportions, text, error, missing fingers, missing arms, missing legs, extra digit, extra arms, extra leg, extra foot, ((repeating hair))
Escala CFG
7
Pasos
20
Recolector
DPM++ 2M Karras
Semilla
-1
Clip Skip
Tamaño de imagen
3072 X 2048
Modelo
ReV Animated
Generar
Tamaño
3072X2048
Fecha
Jun 12, 2023
Modo
Por defecto
Tipo
upscale
Checkpoint & LoRA
ReV Animated
Checkpoint
ReV Animated
0 comentario(s)
0
2
0

Apps de AI Rápido de SeaArt

ai_video_generationimg
Generación de videos con IA

Libera tu imaginación, la IA creará maravillas visuales para ti

face_swap_titleimg
Cambio de cara en línea gratis

Crea rápidamente videos y fotos de cambio de cara divertidos y realistas

vrtry_cloth_h1img
Prueba virtual de ropa

Prueba cualquier tipo de ropa virtualmente con IA

cartoon_avatar_h1img
Creador de avatares de dibujos animados

Convierte tus fotos en avatares de dibujos animados únicos al instante.

kiss_vidimg
Generador de videos de besos con IA

Únete a la tendencia de los besos con el generador de videos de besos AI de SeaArt al instante. Facilita que dos personas se besen y crea una animación realista.

video_face_swapimg
Intercambio de caras en video

Crea videos divertidos intercambiando caras en cualquier clip de video.

Explora más aplicaciones AI 

Recomendaciones relacionadas

ControlNet
avatar
S
avatar_frame
Solei
1
1
ControlNet
avatar
ル
ルックルック
0
0
ControlNet
avatar
M
MO8S4E
0
2
ControlNet
avatar
C
Chicken Mo
0
0
ControlNet
avatar
千
avatar_frame
千尋
0
1
ControlNet
avatar
K
Kirito
0
0
ControlNet
avatar
三
三芳健吾
0
0
ControlNet
avatar
令
令赋
0
0
ControlNet
avatar
P
Popa VL
1
1
ControlNet
avatar
A
autumn
0
3
ControlNet
avatar
F
fitCorder
0
0
ControlNet
avatar
B
beny_07k
0
0
ControlNet
avatar
M
miyuka.yz
0
0
ControlNet
avatar
I
Ian Gabriel Guerrero Loor
0
0
ControlNet
avatar
M
mahdy maram
0
0
ControlNet
avatar
A
Adelia Rocália
0
0
ControlNet
avatar
P
pawapawa
0
0
ControlNet
avatar
E
Eduardo Lopez
0
0
ControlNet
avatar
プ
avatar_frame
プロの酔っ払い
0
0
ControlNet
avatar
タ
タンタム
0
1
ControlNet
avatar
F
Fauzanna Kamilani
0
0
ControlNet
avatar
キ
キリ氏。
0
0
ControlNet
avatar
M
Ms.song
0
1
ControlNet
avatar
M
miguel.escamez
0
1
ControlNet
avatar
土
土匪老大
0
0
ControlNet
avatar
风
风向标
1
0
ControlNet
avatar
メ
メンドーサ
0
0
ControlNet
avatar
E
Ericson Otayza
0
3
ControlNet
avatar
J
Johana Jiménez
0
0
ControlNet
avatar
G
Gugelwan Yutup
0
0
ControlNet
avatar
N
avatar_frame
Number Shot
1
0
ControlNet
avatar
I
INFO FAKTA
0
0
ControlNet
avatar
S
SOUMEN BAG
0
0
ControlNet
avatar
D
dyna nut
0
0
ControlNet
logo
Español
Aplicación
Crear imagen Personajes AI Swift AI Entrenamiento de modelos Canvas Aplicación rápida Flujo de trabajo
Sobre él/ella
Estudio Clasificación Chat IA AI blog AI noticias
Ayuda
Guías Servicio al cliente
Obtener aplicación
icon
Download on the
APP Store
icon
GET IT ON
Google Play
Síguenos
iconiconiconiconiconiconicon
© 2025 SeaArt, Inc.
Copyright Policy
Términos
Privacidad 特定商取引法 資金決済法に基づく表示
Más