Topic 1 Question 280
You are running a streaming pipeline with Dataflow and are using hopping windows to group the data as the data arrives. You noticed that some data is arriving late but is not being marked as late data, which is resulting in inaccurate aggregations downstream. You need to find a solution that allows you to capture the late data in the appropriate window. What should you do?
Use watermarks to define the expected data arrival window. Allow late data as it arrives.
Change your windowing function to tumbling windows to avoid overlapping window periods.
Change your windowing function to session windows to define your windows based on certain activity.
Expand your hopping window so that the late data has more time to arrive within the grouping.
ユーザの投票
コメント(3)
- 正解だと思う選択肢: A
A. Use watermarks to define the expected data arrival window. Allow late data as it arrives.
👍 1scaenruy2024/01/03 - 正解だと思う選択肢: A👍 1Sofiia982024/01/09
- 正解だと思う選択肢: A
- Watermarks: Watermarks in a streaming pipeline are used to specify the point in time when Dataflow expects all data up to that point to have arrived.
- Allow Late Data: configure the pipeline to accept and correctly process data that arrives after the watermark, ensuring it's captured in the appropriate window.
👍 1raaad2024/01/09
シャッフルモード