You are working on a project that involves training a multimodal AI model combining natural language
processing and computer vision to provide real-time captions for live video feeds. However, the captions generated often lag behind the video. Upon investigation, you find that the language model is causing the delay. Which of the following strategies is most likely to reduce the lag and improve synchronization?
Correct Answer: D
Explanation:
Switching to a smaller language model reduces processing time, minimizing the lag between the video
feed and the generated captions.
Question 2
You are developing a multimodal AI system that generates high-resolution images from complex English
text prompts. The system must handle a variety of detailed descriptions and produce visually accurate
results. What strategies are most effective for optimizing the performance of a text-t
Correct Answer: B, C
Explanation:
Using a diverse set of image-text pairs improves the model’s ability to handle a wide range of prompts,
and transformer-based models are effective for encoding complex text descriptions.
Question 3
You are optimizing a generative AI model's performance by tuning hyperparameters. Which of the following strategies is most likely to improve the model's training efficiency without compromising its ability to generalize?
Correct Answer: D
Explanation:
Applying regularization techniques like L2 regularization or dropout helps prevent overfitting, improving
the model's generalization ability while maintaining training efficiency.
Demo Practice Mode
You are viewing only the questions marked as Demo.