Skip to content
/ ASR Public

๐Ÿค—ASR ํ•™์Šต์‹œํ‚ค๊ธฐ ์œ„ํ•œ ์ฝ”๋“œ

Notifications You must be signed in to change notification settings

jp1924/ASR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Abstract

๊ธฐ์กด ๋ฐฉ์‹์˜ ๋ฌธ์ œ์ 

  • ๊ธฐ์กด STT์—์„œ ์‚ฌ์šฉํ• ๋งŒํ•œ ์ˆ˜์ค€์˜ ์„ฑ๋Šฅ์„ ๊ตฌํ˜„ํ•˜๊ธฐ ์œ„ํ•ด์„  ๋ชปํ•ด๋„ ์ˆ˜์ฒœ์‹œ๊ฐ„์˜ ์ „์‚ฌ๋œ ์Œ์„ฑ์ด ํ•„์š”๋กœ ํ•จ. ํ•˜์ง€๋งŒ ์ „ ์„ธ๊ณ„์— ์กด์žฌํ•˜๋Š” 7000๊ฐœ์˜ ์–ธ์–ด ์ค‘ ๋Œ€๋ถ€๋ถ„์€ ํ•ด๋‹น ์กฐ๊ฑด์„ ๋งž์ถ”๊ธฐ ์–ด๋ ค์šด ์–ธ์–ด๊ฐ€ ๋Œ€๋ถ€๋ถ„.

  • ๊ธฐ์กด STT์—์„  2 stage๋กœ ํ•™์Šต์ด ์ง„ํ–‰ ๋˜์—ˆ์œผ๋ฉฐ, 1-step์œผ๋กœ ์–‘์žํ™”๋œ ํ‘œํ˜„์„ ํ•™์Šต, 2-step์œผ๋กœ ์ „์‚ฌ๋œ ๋ฌธ์ž์™€ ๋งค์นญ์„ ์‹œํ‚ค๋Š” ๋ฐฉ์‹์œผ๋กœ ํ•™์Šต์„ ์ง„ํ–‰ํ•ด ์™”์—ˆ์Œ. ๊ทธ๋Ÿฌ๋‹ค ๋ณด๋‹ˆ ํ•™์Šต ํŒŒ์ดํ”„ ๋ผ์ธ์ด ๋ณต์žกํ•˜๊ณ  ํ•™์Šต์ด ๋ถˆ์•ˆ์ • ํ•ด์ง€๋Š” ๋ฌธ์ œ๊ฐ€ ์กด์žฌํ•จ.

๋…ผ๋ฌธ์ด ์ œ์•ˆํ•˜๋Š” ๋ฐฉ์‹

  • ์ „์‚ฌ๋˜์ง€ ์•Š์€ ์Œ์„ฑ์œผ๋กœ ๋ถ€ํ„ฐ๋„ ์Œ์„ฑ์˜ ํŠน์ง•์„ ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” self supervised ๋ฐฉ์‹์˜ PreTrain ๊ธฐ๋ฒ•์„ ์†Œ๊ฐœํ•จ.

Introduction

Model

Training

Masking

Objective

About

๐Ÿค—ASR ํ•™์Šต์‹œํ‚ค๊ธฐ ์œ„ํ•œ ์ฝ”๋“œ

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published