donghwa-kim.github.io/BLEU.html

 

BLEU Score

BLEU BLEU(Bilingual Evaluation Understudy)score๋ž€ ์„ฑ๊ณผ์ง€ํ‘œ๋กœ ๋ฐ์ดํ„ฐ์˜ X๊ฐ€ ์ˆœ์„œ์ •๋ณด๋ฅผ ๊ฐ€์ง„ ๋‹จ์–ด๋“ค(๋ฌธ์žฅ)๋กœ ์ด๋ฃจ์–ด์ ธ ์žˆ๊ณ , y ๋˜ํ•œ ๋‹จ์–ด๋“ค์˜ ์‹œ๋ฆฌ์ฆˆ(๋ฌธ์žฅ)๋กœ ์ด๋ฃจ์–ด์ง„ ๊ฒฝ์šฐ์— ์‚ฌ์šฉ๋˜๋ฉฐ, ๋ฒˆ์—ญ์„ ํ•˜๋Š” ๋ชจ๋ธ์—

donghwa-kim.github.io

 

Blue๋Š” ๋ฒˆ์—ญ๋œ ๋ฌธ์žฅ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ์ง€ํ‘œ๋กœ,

์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ๊ณผ ๊ธฐ๊ณ„๊ฐ€ ๋ฒˆ์—ญํ•œ ๋ฌธ์žฅ๊ณผ์˜ ์œ ์‚ฌ์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ score๋ฅผ ์‚ฐ์ถœํ•œ๋‹ค.

 

1. n-gram ๊ธฐ์ค€ ์–ผ๋งˆ๋‚˜ ๊ฒน์น˜๋Š” ์ง€

  • 1-gram, 2-gram, 3-gram, 4-gram ์ด์šฉํ•˜์—ฌ ์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ๊ณผ ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋ฌธ์žฅ๊ณผ์˜ ์œ ์‚ฌ์„ฑ ๋น„๊ต

2. ๊ฐ™์€ ๋‹จ์–ด๊ฐ€ ์—ฐ์†์ ์œผ๋กœ ๋‚˜์™€ ์„ฑ๋Šฅ์ด ๊ณผ๋Œ€์ธก์ • ๋˜๋Š” ๊ฒƒ์„ ๋ง‰๊ธฐ ์œ„ํ•œ ๋ชฉ์ 

  • ์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ : there is a cat on the mat
  • ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋ฌธ์žฅ : there there there there is
  • 1-gram ๊ธฐ์ค€ ์œ ์‚ฌ์„ฑ์ด 4(=์ผ์น˜ํ•˜๋Š” ๋‹จ์–ด์˜ ์ˆ˜)/5(=๊ธฐ๊ณ„๋ฒˆ์—ญ ๋ฌธ์žฅ์˜ ๊ธธ์ด)๋กœ ํ‰๊ฐ€๋˜๋Š” ๊ฒƒ์„ ๋ง‰๊ธฐ ์œ„ํ•จ

3. ์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ๊ณผ ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋ฌธ์žฅ๊ณผ์˜ ๋ฌธ์žฅ ๊ธธ์ด ๋น„๊ต

  • ์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ ๋Œ€๋น„ ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋ฌธ์žฅ์˜ ๊ธธ์ด๊ฐ€ ์งง๋‹ค๋ฉด penalty ๋ถ€์—ฌ
  • ์‹ค์ œ ๋ฒˆ์—ญ ๋ฌธ์žฅ ๋Œ€๋น„ ๊ธฐ๊ณ„ ๋ฒˆ์—ญ ๋ฌธ์žฅ์˜ ๊ธธ์ด๊ฐ€ ๊ธธ๋‹ค๋ฉด ์ด์  ์ œ๊ณตํ•˜๋Š” ๊ฒƒ X (๊ทธ๋ƒฅ 1 ๊ณฑํ•ด์ฃผ๊ฒŒ ๋œ๋‹ค)

์ด ์„ธ ๊ฐ€์ง€๋ฅผ ๋ชจ๋‘ ๊ณ ๋ คํ•˜์—ฌ Blue score๊ฐ€ ์ •์˜๋œ๋‹ค.

 

์ž์„ธํ•œ ๋‚ด์šฉ์€ ์œ„์— ์–ธ๊ธ‰ํ•œ ๋ธ”๋กœ๊ทธ์— ์ž˜ ์ •๋ฆฌ๋˜์–ด ์žˆ๊ธฐ์— .. !

'๐Ÿ™‚ > Coursera_DL' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

์ฝ”์„ธ๋ผ Deep Learning ์ •๋ฆฌ  (2) 2020.12.27
WEEK8 : Attention  (0) 2020.12.27
WEEK8 : beam search in language model  (0) 2020.12.27
WEEK8 : negative sampling  (0) 2020.12.26
WEEK8 : Word Embedding (word2vec)  (0) 2020.12.26

+ Recent posts