Topic 1 Question 232

Professional Machine Learning Engineer

Topic 1 Question 232
You need to use TensorFlow to train an image classification model. Your dataset is located in a Cloud Storage directory and contains millions of labeled images. Before training the model, you need to prepare the data. You want the data preprocessing and model training workflow to be as efficient, scalable, and low maintenance as possible. What should you do?
- 1. Create a Dataflow job that creates sharded TFRecord files in a Cloud Storage directory.
  2. Reference tf.data.TFRecordDataset in the training script.
  3. Train the model by using Vertex AI Training with a V100 GPU.
- 1. Create a Dataflow job that moves the images into multiple Cloud Storage directories, where each directory is named according to the corresponding label
  2. Reference tfds.folder_dataset:ImageFolder in the training script.
  3. Train the model by using Vertex AI Training with a V100 GPU.
- 1. Create a Jupyter notebook that uses an nt-standard-64 V100 GPU Vertex AI Workbench instance.
  2. Write a Python script that creates sharded TFRecord files in a directory inside the instance.
  3. Reference tf.data.TFRecordDataset in the training script.
  4. Train the model by using the Workbench instance.
- 1. Create a Jupyter notebook that uses an n1-standard-64, V100 GPU Vertex AI Workbench instance.
  2. Write a Python script that copies the images into multiple Cloud Storage directories, where each. directory is named according to the corresponding label.
  3. Reference tfds.foladr_dataset.ImageFolder in the training script.
  4. Train the model by using the Workbench instance.
ユーザの投票
コメント(4)
- 正解だと思う選択肢: A
  millions of labeled images -> dataflow tfrecord faster than folder-based
  
  👍 7
  pinimichele012024/04/08
- 正解だと思う選択肢: A
  Ideally you want to export your data in TFRecords (most efficient image format) in Cloud Storage, and not in the instance (to improve scalability)
  
  👍 3
  b1a8fae2024/01/17
- 正解だと思う選択肢: A
  B. Folder-Based Structure: While viable, it's less efficient for large datasets compared to TFRecord files, potentially leading to slower I/O during training. C. Workbench Processing: Local preprocessing on a single instance can be less scalable and efficient for millions of images, potentially introducing bottlenecks. D. Workbench Training: While Workbench offers a Jupyter environment, Vertex AI Training is specifically designed for scalable model training, providing optimized hardware and infrastructure.
  
  👍 2
  pikachu0072024/01/12
シャッフルモード

ユーザの投票

コメント(4)