Topic 1 Question 102
A company has developed a generative text summarization model by using Amazon Bedrock. The company will use Amazon Bedrock automatic model evaluation capabilities.
Which metric should the company use to evaluate the accuracy of the model?
Area Under the ROC Curve (AUC) score
F1 score
BERTScore
Real world knowledge (RWK) score
ユーザの投票
コメント(4)
- 正解だと思う選択肢: C
BERTScore is a metric specifically designed to evaluate text generation tasks, such as summarization. It measures the semantic similarity between the generated text and the reference text by leveraging contextual embeddings from pre-trained models like BERT.
BERTScore captures deeper semantic relationships, making it ideal for evaluating the accuracy and meaningfulness of summaries.
👍 1ap64912024/12/27 - 正解だと思う選択肢: C
BERTScore is the most appropriate metric for evaluating the accuracy of a generative text summarization model because it compares semantic similarity in a manner that aligns well with the goal of text summarization.
👍 1aws_Tamilan2024/12/27 - 正解だと思う選択肢: C
The correct answer is C. BERTScore is specifically designed for evaluating text generation quality.
👍 1may2021_r2024/12/28
シャッフルモード