Topic 1 Question 48
An ML engineer is using a training job to fine-tune a deep learning model in Amazon SageMaker Studio. The ML engineer previously used the same pre-trained model with a similar dataset. The ML engineer expects vanishing gradient, underutilized GPU, and overfitting problems. The ML engineer needs to implement a solution to detect these issues and to react in predefined ways when the issues occur. The solution also must provide comprehensive real-time metrics during the training. Which solution will meet these requirements with the LEAST operational overhead?
Use TensorBoard to monitor the training job. Publish the findings to an Amazon Simple Notification Service (Amazon SNS) topic. Create an AWS Lambda function to consume the findings and to initiate the predefined actions.
Use Amazon CloudWatch default metrics to gain insights about the training job. Use the metrics to invoke an AWS Lambda function to initiate the predefined actions.
Expand the metrics in Amazon CloudWatch to include the gradients in each training step. Use the metrics to invoke an AWS Lambda function to initiate the predefined actions.
Use SageMaker Debugger built-in rules to monitor the training job. Configure the rules to initiate the predefined actions.
ユーザの投票
コメント(2)
- 正解だと思う選択肢: D👍 2GiorgioGss2024/11/27
- 正解だと思う選択肢: D
SageMaker Debugger → Built-in rules → Monitor training (vanishing gradients, GPU use, overfitting) → Predefined actions → Low overhead
👍 2Saransundar2024/12/05
シャッフルモード