Examtopics

AWS Certified AI Practitioner
  • Topic 1 Question 79

    A company has built a solution by using generative AI. The solution uses large language models (LLMs) to translate training manuals from English into other languages. The company wants to evaluate the accuracy of the solution by examining the text generated for the manuals. Which model evaluation strategy meets these requirements?

    • Bilingual Evaluation Understudy (BLEU)

    • Root mean squared error (RMSE)

    • Recall-Oriented Understudy for Gisting Evaluation (ROUGE)

    • F1 score


    シャッフルモード