Examtopics

AWS Certified AI Practitioner
  • Topic 1 Question 88

    A social media company wants to use a large language model (LLM) to summarize messages. The company has chosen a few LLMs that are available on Amazon SageMaker JumpStart. The company wants to compare the generated output toxicity of these models.

    Which strategy gives the company the ability to evaluate the LLMs with the LEAST operational overhead?

    • Crowd-sourced evaluation

    • Automatic model evaluation

    • Model evaluation with human workers

    • Reinforcement learning from human feedback (RLHF)


    シャッフルモード