Amazon Practice Questions, Discussions & Exam Topics by our Authors
A banking company provides financial products to customers around the world. A machine learning (ML) specialist collected transaction data from internal customers. The ML specialist split the dataset into training, testing, and validation datasets. The ML specialist analyzed the training dataset by using Amazon SageMaker Clarify. The analysis found that the training dataset contained fewer example...
To address this scenario, we need to identify the type of pretraining bias observed by the ML specialist in the training dataset. The specialist noticed that there were fewer examples of customers in the 40 to 55-year-old age group compared to other age groups. This indicates a possible bias related to how different demographic groups are represented in the dataset.
Key Factors to Consider:
- Training Dataset Analysis: The dataset analysis revealed that one particular age group (40-55 years old) had fewer examples compared to other age groups.
- Bias in the Dataset: The imbalance is demographic in nature, and this needs to be addressed to ensure fairness and accuracy in the model.
Let's examine the options:
A) Difference in proportions of labels (DPL)
- Description: DPL occurs when the proportions of different classes (or labels) in the dataset differ significantly from the expected distribution. This bias is typically observed in classification problems where there is a significant difference in the class distribution.
- Why it's not correct: The scenario here doesn't involve an imbalance between class labels (such as "fraud" vs. "non-fraud" transactions), but rather a bias in the representation of different demographic groups (specifically age groups). Therefore, this option does not align with the observed issue.
B) Class imbalance (CI)
- Description: Class imbalance happens when there are significantly more instances of some classes than others in the dataset. This typically occurs in classification tasks where the model is trained on imbalanced target labels, leading to poor model performance on the minority class.
- Why it's not correct: While the scenario does describe an imbalance, it specifically involves age groups...
Author: Andrew · Last updated Apr 3, 2026
A tourism company uses a machine learning (ML) model to make recommendations to customers. The company uses an Amazon SageMaker environment and set hyperparameter tuning completion criteria to MaxNumberOfTrainingJobs.
An ML specialist wants to change the hyperparameter tuning completion criteria. The ML specialist wants to stop tuning immediately after an internal algorithm determi...
In this scenario, the ML specialist wants to stop the hyperparameter tuning jobs when the algorithm determines that the tuning job is unlikely to improve the objective metric by more than 1% over the best training job. This requires a criterion that will monitor the improvement in the objective metric and stop the tuning process when further improvements are deemed unlikely.
A) MaxRuntimeInSeconds
- Use case: This criterion allows you to set the maximum runtime for the entire hyperparameter tuning job. The job will stop after this time limit, regardless of the model’s performance or convergence.
- Advantages: It’s useful when you want to set a time limit on how long the tuning process can run.
- Disadvantages: This doesn’t address the need to stop based on the metric improvement; instead, it focuses on the total time. It doesn’t ensure stopping when further improvement is unlikely.
- Not Ideal: This option does not meet the requirement, as it doesn’t take the improvement in the objective metric into account.
B) TargetObjectiveMetricValue
- Use case: This criterion sets a specific target value for the objective metric. The tuning job will stop once a model reaches or exceeds this value.
- Advantages: Useful when you have a specific threshold of performance in mind and want the tuning job to stop when this threshold is met.
- Disadvantages: This doesn’t consider improvements beyond the target value. The ML specialist wants to stop tuning based on the relative improvement in the objective metric (whether the improvement from the best job is likely to exceed 1%).
- Not Ideal: This criterion is about stopping when a certain performance level is achieved, not about detecting diminishing returns in improvement.
C) CompleteOnConvergence
- Use case: This completion criterion stops the tuning job when further improvements to the objective metric are unlikely. It is based on con...
Author: Ming88 · Last updated Apr 3, 2026
A car company has dealership locations in multiple cities. The company uses a machine learning (ML) recommendation system to market cars to its customers.
An ML engineer trained the ML recommendation model on a dataset that includes multiple attributes about each car. The dataset includes attributes such as car brand, car type, fuel efficiency, and price.
The ML engineer uses Amazon SageMaker Data Wrangler to analyze and visual...
To analyze the distribution of car prices for a specific type of car, the ML engineer needs to identify how the prices vary and the range of values for that car type. The best approach is to visualize the distribution of the car prices specifically for the car type in question.
Option A: Scatter Plot
A scatter plot is useful for visualizing the relationship between two continuous variables, such as car price and another continuous attribute (like fuel efficiency). While a scatter plot could show how price correlates with another feature, it does not directly provide insights into the distribution of car prices within a single type of car. It would be more useful if the goal was to understand how price and another continuous feature vary together.
Rejected Reason: This is not the best option because the goal is to analyze the distribution of car prices for a specific type of car, not to explore relationships between variables.
Option B: Quick Model
The quick model visualization in SageMaker is used to evaluate machine learning models quickly and generate importance scores for different features, such as identifying how much each feature contributes to model predictions. This option is not directly related to visualizing the distribution of a specific feature like car prices for a particular car type.
Rejected Reason: It is used for evaluating model performance and feature importan...
Author: Henry · Last updated Apr 3, 2026
A media company is building a computer vision model to analyze images that are on social media. The model consists of CNNs that the company trained by using images that the company stores in Amazon S3. The company used an Amazon SageMaker training job in File mode with a single Amazon EC2 On-Demand Instance.
Every day, the company updates the model by using about 10,000 images that the company has collected in the last 24 hours. The com...
To meet the company's goal of speeding up training and lowering costs without changing the code, we need to consider options that maximize efficiency and minimize costs. Here’s an analysis of the options:
Option A: Configure to use Pipe mode instead of File mode
Pipe mode allows data to be streamed directly into the training process, avoiding the need to first store all the data in S3 before it can be read. This is especially useful for large datasets because it speeds up the data ingestion process, making training faster.
Advantages:
- Streaming data in real time can reduce the latency of reading from disk, enabling faster training because the model can start processing the data as it is ingested.
- Pipe mode is generally more efficient when the dataset is large or continuously updated, as in the case with the 10,000 new images being added daily.
Selected Reason: This would meet both the speed and cost requirements because it would allow faster ingestion of images and reduce the time needed for training.
Rejected Reason: None. This is a strong option for the scenario where the company needs to speed up training.
Option B: Configure to use FastFile mode
FastFile mode provides an optimized file reading method that allows faster training compared to traditional file reading methods. However, it is most beneficial when the dataset is static or not changing very frequently. Since the company is updating the dataset daily with new images, using FastFile mode wouldn’t necessarily provide the expected improvement because it works best with a more static dataset.
Rejected Reason: Since the dataset is frequently updated, FastFile mode might not be ideal for this use case.
Option C: Configure to use Spot Instances
Spot Instances are a cost-saving solution compared to On-Demand Instanc...
Author: MoonlitPantherX · Last updated Apr 3, 2026
A telecommunications company has deployed a machine learning model using Amazon SageMaker. The model identifies customers who are likely to cancel their contract when calling customer service. These customers are then directed to a specialist service team. The model has been trained on historical data from multiple years relating to customer contracts and customer service interactions in a single geographic region.
The company is planning to launch a new global product that will use this model. Management...
The telecommunications company is launching a global product, and management is concerned that the model might incorrectly direct calls from customers in regions without historical data to the specialist service team. This indicates that there is a potential risk that the model could perform poorly on new data from regions it hasn’t seen before, since it was trained only on historical data from a single geographic region.
Approach Breakdown:
Option A: Enable SageMaker Model Monitor with Data Capture
Model Monitor in SageMaker helps you track the performance of your deployed model over time. By capturing data and monitoring it against a baseline, you can detect if the distribution of incoming data shifts from the distribution the model was trained on.
How it Works:
- The monitoring baseline is created based on the training data.
- It checks if the numerical distance (drift) of regional customer data deviates from the baseline.
- CloudWatch alerts the data scientists when drift is detected, allowing them to evaluate and retrain the model with a larger, more diverse dataset.
Advantages:
- Detects data drift (distribution shift) between historical training data and the new global data, which is a key concern since the model was trained on a single region.
- Allows early detection of model performance issues as new regions use the model.
- Helps ensure that the model adapts and improves as new data from diverse regions is integrated.
Rejected Reason: None. This is a comprehensive solution for detecting model issues caused by the introduction of global customer data.
Option B: Enable SageMaker Debugger with Custom Rule for Baseline Variance
SageMaker Debugger provides real-time insights into model training. You can use it to create custom rules to monitor certain conditions during training or inference, such as variance from the baseline training dataset.
How it Works:
- A custom rule is created to track variance from the baseline training data.
- Alerts are generated via CloudWatch when the rule is triggered.
Rejected Reason: This option is more suited for debugging the mod...
Author: Rahul · Last updated Apr 3, 2026
A machine learning (ML) engineer is creating a binary classification model. The ML engineer will use the model in a highly sensitive environment.
There is no cost associated with missing a positive label. However, the cost of making a false positive ...
In a binary classification model where the cost of making a false positive inference is extremely high, the most important metric to optimize for is Precision.
Reasoning:
Precision measures the proportion of true positive predictions out of all the positive predictions made by the model (i.e., how many of the predicted positives are actually correct). In this scenario, false positives are very costly, so it is crucial to minimize them. High precision ensures that when the model predicts a positive outcome, it is very likely to be correct, thus minimizing the risk of a false positive.
Breakdown of Other Options:
1. A) Accuracy:
- Accuracy measures the overall correctness of the model (the proportion of true positives and true negatives out of all predictions). However, accuracy does not differentiate between the types of errors the model makes. In a situation where false positives are highly costly and false negatives are not as critical, accuracy is not the right metric to prioritize. It could be misleading because a model could have high accuracy by simply predicting the majority class or by avoiding predicting positives altogether, but still have many costly false positives.
Rejected Reason: Accuracy does not prioritize minimizing false positives and is not tailored to this scenario where the cost of false positives is extremely high.
2. B) Precision:
- Precision is the proportion of positive predictions that are actually correct. Since false positives are extremely costly in this case, high precision ensures that when the model predicts a positive outcome, it is more likely to be accurate. It directly minimize...
Author: Liam · Last updated Apr 3, 2026
An ecommerce company discovers that the search tool for the company's website is not presenting the top search results to customers. The company needs to resolve the issue so the search tool will present results that customers are most like...
The eCommerce company wants to resolve the issue where the search tool is not presenting the top results customers are most likely to purchase. The goal is to provide relevant search results with the least operational effort. Let's evaluate each option:
Option A: Use Amazon SageMaker BlazingText for Query Expansion
BlazingText is a powerful algorithm in Amazon SageMaker used for natural language processing (NLP) tasks such as text classification and word embeddings. It could potentially enhance search results by adding context to search queries through techniques like query expansion (e.g., suggesting related or similar terms). However, using this method requires significant setup for model training and fine-tuning, as well as continuous maintenance of the model.
Rejected Reason: Although BlazingText could improve the relevance of search results through query expansion, it requires custom model development and ongoing operational effort. It is more complex and time-consuming compared to using an out-of-the-box solution like Amazon CloudSearch.
Option B: Use Amazon SageMaker XGBoost to Improve Candidate Ranking
XGBoost is a popular machine learning algorithm used for supervised learning tasks, including classification and regression. It could potentially be used to rank search results based on features like past purchase behavior, customer preferences, etc. However, this would involve creating and training a custom model, managing features, and maintaining the model over time.
Rejected Reason: While XGBoost could improve ranking based on various features, it requires significant data preparation, training, and ongoing monitoring. It introduces more operational complexity compared to simpler solutions.
Option C: Use Amazon CloudSearch and Sort Results by Search Relevance Score
Amazon CloudSearch is a fully...
Author: CrystalWolfX · Last updated Apr 3, 2026
A machine learning (ML) specialist collected daily product usage data for a group of customers. The ML specialist appended customer metadata such as age and gender from an external data source.
The ML specialist wants to understand product usage patterns for each day of the week for customers in specific age groups. The ML specialist creates two categorical features ...
To determine the relationship between two categorical variables, dayofweek and binned_age, it's important to choose a method that highlights the interaction between these variables. Let's break down the options and their relevance:
A) Create a scatterplot for day_of_week and binned_age:
- Rejection Reason: A scatterplot is typically used for visualizing relationships between continuous variables, not categorical ones. Since both dayofweek (days of the week) and binned_age (age groups) are categorical features, a scatterplot wouldn't be the best way to reveal patterns or relationships between them.
B) Create crosstabs for day_of_week and binned_age:
- Selected Option: A crosstab (or contingency table) is the most appropriate method for discovering relationships between two categorical variables. It will display how frequently each combination of categories occurs (i.e., how many customers in each age group use the product on each day of the week). This can help the ML specialist identify any patterns or trends in product usage relative to both the day of the week and the binned age group.
C) Create word clouds for day_of_week and binned_age:
- Rej...
Author: FrozenWolf2022 · Last updated Apr 3, 2026
A company needs to develop a model that uses a machine learning (ML) model for risk analysis. An ML engineer needs to evaluate the contribution each feature of a training dataset makes to the prediction of the target variable before the...
To evaluate the contribution of each feature in the prediction of the target variable for a risk analysis model, the ML engineer needs a method that quantifies how important each feature is to the model's prediction. Let's analyze each option:
A) Use the Amazon SageMaker Data Wrangler multicollinearity measurement features and the principal component analysis (PCA) algorithm to calculate the variance of the dataset along multiple directions in the feature space:
- Rejection Reason: While PCA is useful for dimensionality reduction and understanding the variance in the feature space, it does not directly provide information about the contribution of individual features to the target variable. PCA focuses on transforming features to capture maximum variance, not on evaluating feature importance for prediction purposes. Multicollinearity measurement identifies correlated features, but this is not the same as evaluating feature contributions to the model's predictions.
B) Use an Amazon SageMaker Data Wrangler quick model visualization to find feature importance scores that are between 0.5 and 1:
- Selected Option: This approach directly addresses the need to evaluate the contribution of each feature to the model’s predictions. Amazon SageMaker provides built-in tools to visualize model outcomes and feature importance scores. These scores indicate the relative importance of each feature in making predictions, which is exactly what the ML engineer needs to assess before selecting the most relevant features for th...
Author: Abigail · Last updated Apr 3, 2026
A company is building a predictive maintenance system using real-time data from devices on remote sites. There is no AWS Direct Connect connection or VPN connection between the sites and the company's VPC. The data needs to be ingested in real time from the devices into Amazon S3.
Transformation is needed to convert the raw data into clean .csv data to be fed into the machine learning (ML) model. The transformation needs to happen during the ingestion process. When transformation fails, the records need to be stored in ...
To design a solution that meets the company's requirements for real-time data ingestion, transformation, backup, and storage, we need to consider the following constraints:
1. Real-Time Data Ingestion: The data needs to be ingested in real-time.
2. Transformation: Data must be transformed during ingestion (raw data into clean .csv format).
3. Error Handling: When transformation fails, the records need to be stored in a specific location for human review.
4. Backup of Raw Data: The raw data before transformation also needs to be stored for later use.
Let's evaluate the options based on these factors:
A) Use Amazon Data Firehose with Amazon S3 as the destination. Configure Firehose to invoke an AWS Lambda function for data transformation. Enable source record backup on Firehose:
- Selected Option: Amazon Kinesis Data Firehose is designed to easily handle real-time data ingestion. By configuring it to invoke an AWS Lambda function, the data can be transformed during ingestion. Enabling source record backup ensures that raw data is stored in S3. Additionally, Firehose supports the error prefix configuration, which allows us to direct transformation failures to a different S3 location for human review. This approach requires the least effort, as Firehose is a fully managed service that simplifies ingestion, transformation, and storage.
B) Use Amazon Managed Streaming for Apache Kafka. Set up workers in Amazon Elastic Container Service (Amazon ECS) to move data from Kafka brokers to Amazon S3 while transforming it. Configure workers to store raw and unsuccessfully transformed data in different S3 buckets:
- Rejection Reason: While Kafka can handle high-throughput real-time data streams, it introduces additional complexity. Setting up workers in ECS adds overhead, as the system would require manual management of ECS instances, Kafka brokers, and data pipelines. This makes the solution more com...
Author: Matthew · Last updated Apr 3, 2026
A company wants to use machine learning (ML) to improve its customer churn prediction model. The company stores data in an Amazon Redshift data warehouse.
A data science team wants to use Amazon Redshift machine learning (Amazon Redshift ML) to build a model and run predictions for new data directly within the da...
To use Amazon Redshift ML for building a churn prediction model directly within the data warehouse, the company needs to follow the right set of steps. Let’s evaluate each option carefully:
A) Define the feature variables and target variable for the churn prediction model:
- Selected Option: Defining the feature variables and the target variable is an essential first step when building any machine learning model, including with Amazon Redshift ML. The features are the inputs for the model, and the target variable is the output the model will predict (in this case, customer churn). This step is necessary to ensure the model is trained correctly on the right data.
B) Use the SOL EXPLAIN_MODEL function to run predictions:
- Rejection Reason: The EXPLAIN_MODEL function in Amazon Redshift is used for explaining the model's predictions and understanding how the model works, rather than for running predictions. This function provides insights into feature importance and model performance but is not meant for actual prediction. Therefore, this option doesn't align with the task of running predictions on new data.
C) Write a CREATE MODEL SQL statement to create a model:
- Selected Option: To create a model in Amazon Redshift ML, you need to use the CREATE MODEL SQL statement. This statement defines the model, specifies the algorithm, and indicates the training data to be used. This is the core step to train the model in Redshift ML, so it is required for the workflow.
D) Use Amazon Redshift Spectrum to train the model:
- Rejection Reason: Amazon Redshift Spectrum allows you to query data stored in Amazon S3 using Redshift, but it is not directly used to train models. The model training in ...
Author: Julian · Last updated Apr 3, 2026
A company's machine learning (ML) team needs to build a system that can detect whether people in a collection of images are wearing the company's logo. The company has a set of labeled t...
The task at hand involves detecting whether people in a collection of images are wearing the company's logo. This is a image classification problem, where the goal is to identify specific patterns or objects (the logo) within images. Let’s break down the options and reason which is most suitable:
A) Principal component analysis (PCA):
- Rejection Reason: PCA is a technique primarily used for dimensionality reduction, not image classification. It works by reducing the number of features in a dataset while preserving as much variance as possible. While PCA can be used in preprocessing stages to reduce the size of image data, it is not a classification algorithm. Therefore, it wouldn't be the right choice for detecting the logo in images directly.
B) Recurrent neural network (RNN):
- Rejection Reason: RNNs are designed for processing sequential data, such as time series or text, where the order of the data points matters. While RNNs are powerful for tasks like natural language processing or time-series forecasting, they are not ideal for image classification tasks. Images are best processed using models designed to handle spatial hierarchies in the data, which is why RNNs are not suitable for this problem.
C) 0:...
Author: RadiantJaguar56 · Last updated Apr 3, 2026
A data scientist uses Amazon SageMaker Data Wrangler to obtain a feature summary from a dataset that the data scientist imported from Amazon S3. The data scientist notices that the predictio...
To explain the cause of the prediction power score of 1, let's analyze each option:
Option A: Target leakage occurred in the imported dataset
- Reasoning: Target leakage happens when information from the target variable is used as a feature in the dataset, which can lead to an overly optimistic model. In the case of prediction power, if there's target leakage, the model could "cheat" by having access to future information, leading to high prediction power.
- Why it’s relevant: A prediction power score of 1 could indicate that the feature is perfectly correlated with the target variable due to target leakage.
- Why other options are rejected: While target leakage would lead to perfect prediction power, it specifically affects the data used for training, and there is no direct mention in the scenario that leakage occurred.
Option B: The data scientist did not fine-tune the training and validation split
- Reasoning: Not fine-tuning the split could cause issues with model performance, such as overfitting or underfitting. However, this would not directly explain why the prediction power of a feature would be 1. The prediction power score is more about the relationship between features and target, not the model's validation or training split.
- Why it’s rejected: The lack of fine-tuning the training/validation split wouldn't directly impact the feature's prediction power score, which is the concern here.
Option C: The SageMaker Data Wrangler algorithm that the data scientist used did not find an optimal model fit for each feature to calculate the prediction po...
Author: Sophia Clark · Last updated Apr 3, 2026
A data scientist is conducting exploratory data analysis (EDA) on a dataset that contains information about product suppliers. The dataset records the country where each product supplier is located as a two-letter text code. For example, the code for New Zealand is "NZ."
The data scientist needs to transform the country codes for model training. The data scientist must choose the sol...
To meet the requirements of transforming the country codes for model training, the solution must result in the smallest increase in dimensionality while preserving all the information in the country code. Let’s evaluate each option and see how it aligns with these goals.
Key Requirements:
1. Smallest increase in dimensionality: The solution should not add excessive features or complexity.
2. No information loss: The transformation must retain all the relevant information about the country codes.
Option A) Add a new column of data that includes the full country name
- Explanation: This option would add a new column with the full country names, such as "New Zealand" instead of "NZ."
- Why it's not ideal: Adding the full country name would increase the dimensionality, as each country would still be treated as a string, leading to potential problems with handling string features in machine learning models. Moreover, some models might struggle with string data unless further encoding is applied, and it doesn't reduce dimensionality in any way.
- When it could be used: This option could be useful in certain cases if interpretability or understanding of full country names is crucial. However, it doesn’t meet the requirement of minimizing dimensionality and could lead to unnecessary complexity.
Option B) Encode the country codes into numeric variables by using similarity encoding
- Explanation: Similarity encoding involves encoding categorical variables (like country codes) into numeric representations based on some similarity between the categories.
- Why it's not ideal: While this can work in some cases, similarity encoding might introduce an unintended ordering or distance between countries that doesn't exist in the actual data. For example, encoding countries like "NZ" and "US" with arbitrary numeric values could mislead the model into thinking there’s a relationship between countries based on the numeric values. This could introduce bias.
- When it could be used: This encoding might work for certain tasks where the exact relationships between the countries matter, but it’s not suitable when there's no inherent ordering or similarity between categories, as it could introdu...
Author: Liam · Last updated Apr 3, 2026
A data scientist is building a new model for an ecommerce company. The model will predict how many minutes it will take to deliver a package.
During model training, the data scientist needs to evaluate model perfor...
When evaluating model performance for predicting continuous values, such as the time it will take to deliver a package (a regression task), the following evaluation metrics are commonly used:
Option A: InferenceLatency
- Reasoning: Inference latency refers to the time taken for a model to make a prediction once it has been trained. While this is an important metric in production environments for performance and speed, it is not a measure of how well the model predicts the continuous output (time in minutes). It does not evaluate prediction accuracy or error in predictions.
- Why it’s rejected: This metric is not useful for evaluating the accuracy of the model’s predictions, which is the key focus here.
Option B: Mean Squared Error (MSE)
- Reasoning: MSE is a widely used metric for regression tasks. It measures the average of the squared differences between the predicted and actual values. This is a key metric for understanding how much the predictions deviate from the true values, with larger errors being penalized more heavily.
- Why it’s selected: MSE gives a good indication of how far off predictions are, and it works well when we want to penalize larger errors more. It's one of the standard metrics for regression tasks, such as predicting delivery time.
Option C: Root Mean Squared Error (RMSE)
- Reasoning: RMSE is simply the square root of MSE. While MSE gives a sense of error in squared units, RMSE brings the error back to the original scale of the target variable (minutes, in this case). This makes RMSE easier to interpret because it reflects the actual predicti...
Author: Aria · Last updated Apr 3, 2026
A machine learning (ML) specialist is developing a model for a company. The model will classify and predict sequences of objects that are displayed in a video. The ML specialist decides to use a hybrid architecture that consists of a convolutional neural network (CNN) followed by a classifier three-layer recurrent neural network (RNN).
The company developed a similar model previously but trained the model to classify a different set of objects. The ML specialist wants to save tim...
To accomplish the goal of adapting a previously trained model to a new use case and set of objects with the least effort, the machine learning (ML) specialist should leverage transfer learning. This involves freezing most of the earlier layers and only retraining certain parts of the model to adjust to the new task. Let’s go through the options:
Option A: Reinitialize the weights of the entire CNN. Retrain the CNN on the classification task by using the new set of objects.
- Reasoning: Reinitializing the weights of the entire CNN and retraining it on the new task would undo any benefits of the pre-trained CNN. The CNN layers have learned valuable feature extraction capabilities from the previous task, and retraining the entire CNN would be inefficient. Additionally, the CNN is typically responsible for feature extraction, and this part may still work well for the new objects without needing a full retraining.
- Why it’s rejected: This option is not efficient because it disregards the power of transfer learning, where we want to reuse learned features without starting from scratch.
Option B: Reinitialize the weights of the entire network. Retrain the entire network on the prediction task by using the new set of objects.
- Reasoning: Reinitializing the entire network (both CNN and RNN) and retraining on the new task would require significant computational resources and time. The CNN has already learned useful features in the previous task, and reinitializing the entire network would lead to unnecessary re-learning of those features.
- Why it’s rejected: This is highly inefficient because it ignores the possibility of using transfer learning to save time by reusing already learned features and model components.
Option C: Reinitialize the weights of the entire RNN. Retrain the entire model on the prediction task by using the new set of objects.
- Reasoning: Reinitializing the RNN would erase the knowledge it has gained from the previous model, particularly if the new prediction task is somewhat related to the previous task. This would make the adaptation proces...
Author: Liam · Last updated Apr 3, 2026
A company distributes an online multiple-choice survey to several thousand people. Respondents to the survey can select multiple options for each question.
A machine learning (ML) engineer needs to comprehensively represent every response from all respondents in a dataset. Th...
To comprehensively represent every response from all survey respondents, the ML engineer needs a way to convert the multiple-choice answers (which can include multiple selections per question) into a format suitable for training a logistic regression model. Logistic regression requires the input data to be in numerical format, and the chosen solution should allow for every possible response to be captured while preserving the structure of the survey answers. Let's evaluate each option:
Option A: Perform one-hot encoding on every possible option for each question of the survey
- Reasoning: One-hot encoding is commonly used for categorical data in machine learning. In this case, each possible option from the survey would be represented as a separate binary feature (0 or 1), indicating whether or not a respondent selected that option. Since respondents can select multiple options for each question, each question will have multiple binary columns (one for each possible answer), and each respondent’s answers will be represented as a vector of 0s and 1s.
- Why it’s selected: One-hot encoding is a well-suited method to represent the data, as it maintains the structure of the survey responses (where multiple options can be selected). This approach can be directly used for training a logistic regression model, which can handle multiple binary features effectively. It also ensures that no information is lost in representing the responses.
- Why other options are rejected: This method provides a clear, structured, and efficient way to represent categorical responses in a dataset, which is essential for logistic regression.
Option B: Perform binning on all the answers each respondent selected for each question
- Reasoning: Binning typically involves grouping continuous or ordinal data into predefined ranges or bins. In the case of multiple-choice questions with potentially categorical or discrete options, binning may not be appropriate because it could obscure important distinctions between the choices (e.g., treating two unrelated answers as one). Additionally, binning does not naturally handle the situation where respondents can select multiple answers.
- Why it’s rejected: Binning is more suitable for con...
Author: Kai99 · Last updated Apr 3, 2026
A manufacturing company stores production volume data in a PostgreSQL database.
The company needs an end-to-end solution that will give business analysts the ability to prepare data for processing and to predict future production volume based the previous year's production volume. The soluti...
Key Factors to Consider:
1. Ease of Use: The company requires a solution that does not require coding knowledge. The ease of use is critical here.
2. Data Preparation: Business analysts need to be able to prepare data, which involves cleaning, transforming, and organizing the data for analysis and modeling.
3. Prediction Modeling: The company needs to predict future production volume based on the previous year's data, so the prediction tool should allow easy creation and deployment of machine learning models.
4. Integration with PostgreSQL: The solution needs to work seamlessly with the existing PostgreSQL database, allowing for direct integration without excessive manual effort.
5. Automation and Scalability: Ideally, the solution should handle data at scale and automate many tasks, such as data migration and model training.
Analysis of Options:
Option A:
- AWS Database Migration Service (AWS DMS): AWS DMS can migrate data to Amazon S3 but is more focused on database migration rather than continuous data processing and preparation.
- Amazon EMR: EMR is a big data processing tool, and it is more suited for complex processing and heavy lifting. However, it generally requires coding knowledge, particularly with Spark or Hadoop. It is not the easiest for business analysts without technical skills.
- Amazon SageMaker Studio: While SageMaker Studio is great for building and training machine learning models, it’s more suited for developers and data scientists, not business analysts without coding knowledge.
- Conclusion: This option is too complex for business analysts and requires a higher level of technical expertise.
Option B:
- AWS Glue DataBrew: This is a no-code data preparation tool designed for business analysts. It allows for data cleansing, transformation, and preparation without requiring coding. It directly connects to the PostgreSQL database and can easily extract data for processing.
- Amazon SageMaker Canvas: SageMaker Canvas is a no-code tool t...
Author: NightmareDragon2025 · Last updated Apr 3, 2026
A data scientist needs to create a model for predictive maintenance. The model will be based on historical data to identify rare anomalies in the data.
The historical data is stored in an Amazon S3 bucket. The data scientist needs to use Amazon SageMaker Data Wrangler to ingest the data. The data scientist also needs to perform exploratory data analys...
In this scenario, the data scientist needs to use Amazon SageMaker Data Wrangler for data ingestion and exploratory data analysis (EDA) of the historical data stored in Amazon S3. The goal is to identify rare anomalies while using the least amount of compute resources. Let's go through the options and evaluate them based on the requirements:
Key Factors:
- Minimize compute resources: The solution should use the least amount of compute resources.
- Perform Exploratory Data Analysis (EDA): The data scientist needs to perform EDA, which requires understanding statistical properties of the data.
- Data ingestion: The method of data ingestion must balance between sampling the data and ensuring it is representative for analysis.
Option A) Import the data by using the None option
- Explanation: The "None" option implies that the data would be imported without any sampling, meaning the entire dataset would be ingested for processing.
- Why it may not be ideal: Importing the entire dataset might require significant compute resources, especially if the dataset is large, which contradicts the goal of using the least amount of compute resources. Since the task involves identifying rare anomalies, importing the entire dataset might also lead to unnecessary complexity in the analysis if a smaller subset would suffice.
- When it could be used: This might be suitable for small datasets or when the entire dataset is required for analysis, but it is not the most resource-efficient approach, particularly for large datasets.
Option B) Import the data by using the Stratified option
- Explanation: Stratified sampling involves ensuring that each class or group is proportionally represented in the sample based on a particular feature. This could be useful when the dataset contains multiple groups (e.g., different machine types or failure modes).
- Why it may not be ideal: While stratified sampling ensures proportional representation, it might still result in a relatively large sample, depending on the number of strata and the size of the data. This option could lead to more compute usage if many classes are present, which does not align with the goal of minimizing compute resources.
- When it could be used: This option would be ideal if there is a need to maintain proportional representation of different groups, but it’s more resource-intensive than necessary if the...
Author: Noah · Last updated Apr 3, 2026
An ecommerce company has observed that customers who use the company's website rarely view items that the website recommends to customers. The company wants to recommend items to customers that customers are more likely t...
Key Requirements:
- Short time to implement: The solution should provide an effective recommendation system in the shortest time possible.
- Improving product recommendations: The company wants to recommend products that customers are more likely to want to purchase.
Option A) Host the company's website on Amazon EC2 Accelerated Computing instances to increase the website response speed
- Explanation: Hosting the website on Amazon EC2 Accelerated Computing instances can improve the website's overall response time by leveraging GPUs or other hardware accelerators for computational tasks.
- Why it's not ideal: While it could potentially speed up the response time of the website, it doesn't address the core issue of providing better, more relevant product recommendations. It focuses on infrastructure rather than improving recommendation algorithms.
- When it could be used: This approach could be useful if the website’s performance is slow, but it doesn't directly tackle the need for better product recommendations based on customer preferences.
Option B) Host the company's website on Amazon EC2 GPU-based instances to increase the speed of the website's search tool
- Explanation: Hosting the website on EC2 GPU-based instances might improve the search tool's speed by accelerating certain computational processes.
- Why it's not ideal: Similar to Option A, this improves the speed of search, but it doesn’t help with creating better, more relevant recommendations. The issue isn’t with the speed of search but with the relevance of the recommendations themselves.
- When it could be used: This could be useful if the search tool is computationally intensive, but it doesn't solve the problem of improving personalized product recommendations.
Option C) Integrate Amazon Personalize into the company's website to provide customers with personalized recommendations
- Explanation: Amazon Personalize is a managed service that allows you to easily build and deploy personalized recommend...
Author: Emma Brown · Last updated Apr 3, 2026
A machine learning (ML) engineer is preparing a dataset for a classification model. The ML engineer notices that some continuous numeric features have a significantly greater value than most other features. A business expert explains that the features are independently informative and that the dataset is representative of the target distribution.
After training, t...
Key Requirements:
- Improve model inference accuracy: The goal is to increase the model's inference accuracy, which has not met expectations despite training.
- Handle problematic continuous numeric features: Some continuous features have much greater values than the rest, leading to potential model performance issues.
Option A) Normalize the problematic features
- Explanation: Normalization scales the features to a common range, typically [0, 1], which helps avoid one feature dominating the model due to its larger values. In this case, the problematic features with significantly greater values could be skewing the model's performance.
- Why it's ideal: Normalization (or standardization, depending on the model) helps put all features on a similar scale. This reduces the impact of large values from certain features and ensures that the model is not disproportionately influenced by them, resulting in improved accuracy, especially when using distance-based algorithms or gradient-based models.
- When it could be used: This is the best choice when features have significantly different value ranges and need to be adjusted to ensure the model treats all features equally.
Option B) Bootstrap the problematic features
- Explanation: Bootstrapping involves resampling the dataset with replacement, creating multiple versions of the dataset to build an ensemble model. It’s typically used to improve model robustness, particularly with small datasets or when dealing with variability.
- Why it's not ideal: Bootstrapping doesn’t directly address the issue of feature scaling or the large discrepancy in values between features. While it could help with improving generalization or model performance in some scenarios, it won’t specifically address the feature range problem and will likely not result in the greatest increase in accuracy in this case.
- When it could be used: This is more useful when d...
Author: BlazingPhoenix22 · Last updated Apr 3, 2026
A data scientist uses Amazon SageMaker to perform hyperparameter tuning for a prototype machine leaming (ML) model. The data scientist's domain knowledge suggests that the hyperparameter is highly sensitive to changes.
The optimal value, x, is in the 0.5 < x < 1.0 range. The data scientist's domain knowledge suggests that the optimal value is close to 1.0.
The data scientist needs to find the optimal hyperparameter va...
Key Requirements:
- Optimal hyperparameter value is likely near 1.0 within the range \(0.5 < x < 1.0\), and it is highly sensitive to changes.
- Minimize number of runs and ensure consistent tuning conditions while exploring the optimal hyperparameter value.
Option A) Auto scaling
- Explanation: Auto scaling adjusts the scale automatically based on the range of hyperparameter values during the tuning process.
- Why it’s not ideal: Auto scaling is not a standard scaling technique used for hyperparameter tuning, and it may not focus on the range of values where the optimal hyperparameter is located. It may not be able to provide the level of precision needed in a sensitive region like \(0.5 < x < 1.0\).
- When it could be used: Auto scaling may work in general cases, but it is not optimal for fine-tuning in a specific sensitive range like this.
Option B) Linear scaling
- Explanation: Linear scaling evenly distributes the values across the range from \(0.5\) to \(1.0\), treating all values as equally spaced.
- Why it’s not ideal: While linear scaling gives evenly spaced values, it does not prioritize the finer granularity of values closer to 1.0, which the data scientist’s domain knowledge suggests is the optimal region. This approach might not efficiently explore the sensitive region near 1.0.
- When it could be used: Linear scaling is useful when the hyperparameter sensitivity is relatively uniform across the range, but it's not optimal for a region with highly sensitive changes near the upper end (like \(x \approx 1.0\)).
Option C) Logarithmic scaling
- Explanation: Logarithmic scaling adjusts values based on a logarithmic distribution, emphasizing values toward the lower end of the range and de-emphasizing values toward the higher end.
- Why it’s not ideal: Logarithmic scaling is designed for cases where values span order...
Author: Ethan Smith · Last updated Apr 3, 2026
A data scientist uses Amazon SageMaker Data Wrangler to analyze and visualize data. The data scientist wants to refine a training dataset by selecting predictor variables that are strongly predictive of the target variable. The target variable correlates with other predictor variables.
The data scientist want...
Key Factors to Consider:
1. Predictor Selection: The data scientist wants to select predictor variables that are strongly predictive of the target variable, focusing on understanding how each predictor relates to the target.
2. Correlation Between Features: The data scientist is aware that predictor variables correlate with each other, which can introduce multicollinearity that can complicate model training and performance.
3. Variance Understanding: The data scientist wants to understand the variance in the data along different directions in the feature space, which suggests a need for dimensionality reduction or methods that can highlight relationships between features and target variables.
Analysis of Options:
Option A: Use the SageMaker Data Wrangler multicollinearity measurement features with a variance inflation factor (VIF) score:
- Explanation: The VIF score measures how much the variance of a regression coefficient is inflated due to collinearity with other predictors. High VIF values indicate high collinearity, which can negatively impact model training.
- Pros: This is directly relevant to the data scientist's need to assess correlations between features. It allows the identification of which predictors are collinear and could be redundant in terms of their predictive power. Reducing multicollinearity often improves model performance and interpretability.
- Cons: While VIF can help identify correlated features, it does not directly provide insight into the variance of the data along various directions in the feature space. This method focuses more on identifying collinearity rather than understanding the overall feature space variance.
- Conclusion: VIF is a strong method for addressing multicollinearity, but it doesn’t fully address the variance analysis requirement in the feature space.
Option B: Use the SageMaker Data Wrangler Data Quality and Insights Report quick model visualization to estimate the expected quality of a model that is trained on the data:
- Explanation: This feature provides insights into the overall quality of the data and could estimate the expected quality of a model trained on the dataset. However, this is focused on model performance prediction rather than on understanding relationships and variance within the data itself.
- Pros: Provides a quick way to assess potential model quality and data issues.
- Cons: This option does not specifically help with selecting predictor variables, measuring collinearity, or analyzing variance within the feature space. It is more of a high-level diagnostic tool for model performance, not a tool for data exploration or feature selection.
- Conclusion: This is not the right option for exploring feature rela...
Author: Sara · Last updated Apr 3, 2026
A business to business (B2B) ecommerce company wants to develop a fair and equitable risk mitigation strategy to reject potentially fraudulent transactions. The company wants to reject fraudulent transactions despite the possibility of losing some profitab...
Key Factors to Consider:
1. Operational Effort: The solution should minimize operational effort, meaning it should be easy to implement and manage without requiring extensive customization or manual intervention.
2. Risk Mitigation: The company wants to reject fraudulent transactions while accepting that some legitimate transactions or customers may be lost in the process. This suggests a higher priority on identifying and rejecting fraudulent transactions than minimizing false positives.
3. Fairness and Equity: The solution should be fair and equitable in how transactions are evaluated, ensuring that legitimate transactions are not unfairly rejected while minimizing fraud.
4. Fraud Detection Model Complexity: The solution should offer effective fraud detection without excessive complexity or operational burden.
Analysis of Options:
Option A: Use Amazon SageMaker to approve transactions only for products the company has sold in the past:
- Explanation: This solution restricts transaction approval to only those products the company has sold before, effectively limiting the scope of approved transactions.
- Pros: This method is simple and would reduce the risk of fraud by rejecting new, unfamiliar transactions that might be more prone to fraud.
- Cons: This solution is too simplistic and does not address the real problem of detecting fraudulent transactions across a broader range of potential activities. It would also likely result in rejecting legitimate transactions for new or rare products that the company hasn't sold before, thus causing unnecessary customer dissatisfaction.
- Conclusion: While reducing the scope of transactions may reduce fraud in a limited way, it is not an efficient or comprehensive solution and may result in a poor customer experience.
Option B: Use Amazon SageMaker to train a custom fraud detection model based on customer data:
- Explanation: Amazon SageMaker could be used to train a custom machine learning model to detect fraudulent transactions based on historical customer data, such as transaction behavior, account characteristics, and purchase patterns.
- Pros: A custom model tailored to the company’s specific data could potentially detect fraud with high precision and relevance to the company’s context.
- Cons: Building and training a custom model using SageMaker requires significant time, expertise, and ongoing maintenance. It also involves model tuning, data preparation, and model evaluation, which may increase operational effort, especially for a business that doesn't have the necessary expertise in-house.
- Conclusion: While this option could be effe...
Author: Ishaan · Last updated Apr 3, 2026
A data scientist needs to develop a model to detect fraud. The data scientist has less data for fraudulent transactions than for legitimate transactions.
The data scientist needs to check for bias in the model before finalizing the model. The data scientist needs...
Key Requirements:
- Detect fraud: The model should be capable of detecting fraud, which involves handling imbalanced datasets (more legitimate transactions than fraudulent ones).
- Bias checking: The data scientist needs to check for bias in the model to ensure fairness and accurate results.
- Develop the model quickly: The solution should facilitate rapid model development without too much operational overhead.
- Handle imbalanced data: Fraudulent transactions are less frequent, so techniques to deal with class imbalance (e.g., SMOTE) are necessary.
Option A) Process and reduce bias by using the synthetic minority oversampling technique (SMOTE) in Amazon EMR. Use Amazon SageMaker Studio Classic to develop the model. Use Amazon Augmented AI (Amazon A2I) to check the model for bias before finalizing the model.
- Explanation: SMOTE is applied in Amazon EMR to address class imbalance, and Amazon SageMaker Studio Classic is used for model development. Amazon A2I is used for bias checking.
- Why it’s not ideal: While SMOTE and A2I are effective, Amazon EMR introduces additional operational complexity compared to simpler alternatives. Amazon SageMaker Studio Classic is also an older version, and Amazon SageMaker Clarify would be a more streamlined and effective choice for bias detection, as it is specifically built for that purpose. A2I would be more applicable in human-in-the-loop use cases rather than for model training and bias detection.
- When it could be used: This is not the most efficient or lowest overhead solution, given the tools involved.
Option B) Process and reduce bias by using the synthetic minority oversampling technique (SMOTE) in Amazon EMR. Use Amazon SageMaker Clarify to develop the model. Use Amazon Augmented AI (Amazon A2I) to check the model for bias before finalizing the model.
- Explanation: This option still involves SMOTE in Amazon EMR for handling class imbalance and Amazon SageMaker Clarify for bias detection. A2I is used to check the model for bias.
- Why it’s not ideal: While SageMaker Clarify is the right tool for bias detection, using SMOTE in Amazon EMR creates more operational overhead and complexity than using SageMaker Studio. Additionally, using A2I for bias checking introduces unnecessary complexity, as Clarify is the better tool for this specific task.
- When it could be used: If the data scientist is already familiar with EMR and prefers using it, but thi...
Author: Daniel · Last updated Apr 3, 2026
A company has 2,000 retail stores. The company needs to develop a new model to predict demand based on holidays and weather conditions. The model must predict demand in each geographic area where the retail stores are located.
Before deploying the newly developed model, the company wants to test the model for 2 to 3 days. The model needs to be robust enough to adapt to supply cha...
To meet the company's requirements for predicting demand with minimal operational overhead, while ensuring adaptability to supply chain and retail store needs, the company should focus on leveraging tools that are scalable and require minimal maintenance while allowing for testing and optimization.
Key Requirements:
1. Test for 2-3 days: The company wants to test the model before full deployment, so it needs a mechanism that allows quick testing with real data.
2. Robustness: The model needs to adapt well to supply chain and retail store demands, meaning flexibility and quick adjustments to demand signals, weather, and holidays.
3. Minimal Operational Overhead: The company needs to minimize the operational complexity while ensuring effectiveness.
---
Evaluating Each Option:
A) Develop the model by using the Amazon Forecast Prophet model:
- Explanation: Amazon Forecast's Prophet model is a time-series forecasting model optimized for scenarios like demand prediction, which includes handling seasonality (e.g., holidays) and trends. It supports demand prediction based on historical data and is suited for integrating features like weather and holidays.
- Why selected: This option leverages a pre-built model that is tailored for forecasting time-series data and adapts well to holiday and weather conditions, reducing the overhead of custom development.
- Why rejected: There is no testing or deployment strategy mentioned here, and testing for 2-3 days would require further deployment strategy steps.
B) Develop the model by using the Amazon Forecast holidays featurization and weather index:
- Explanation: This option suggests enhancing the model with holiday and weather features using Amazon Forecast’s built-in capabilities. By adding these features, the model can account for the impact of holidays and weather conditions on demand.
- Why selected: By incorporating these features, the model can more accurately predict demand, which is crucial for meeting the company’s needs. Additionally, using Amazon Forecast reduces operational overhead by leveraging managed services that automatically handle much of the complex data preprocessing.
- Why rejected: While it's important for model development, deployment strategies are not addressed here, and testing for a short duration is not covered directly.
C) Deploy the model by using a canary strategy that uses Amazon SageMaker and AWS Step Functions:
- Explanation:...
Author: Rahul · Last updated Apr 3, 2026
A finance company has collected stock return data for 5,000 publicly traded companies. A financial analyst has a dataset that contains 2,000 attributes for each company. The financial analyst wants to use Amazon SageMaker to identify the top 15 attributes that are most valuab...
To meet the financial analyst's goal of identifying the top 15 attributes that are most valuable for predicting future stock returns, the solution must be efficient, require minimal operational overhead, and accurately highlight the important features. Let’s evaluate the options based on these criteria.
Key Requirements:
1. Identify the top 15 attributes: The analyst needs to identify the most valuable features for prediction.
2. Prediction task: The goal is to predict future stock returns, which involves regression, not classification.
3. Minimal operational overhead: The solution should be straightforward and require minimal configuration, automating as much of the process as possible.
---
Evaluating Each Option:
A) Use the linear learner algorithm in SageMaker to train a linear regression model to predict the stock returns. Identify the most predictive features by ranking absolute coefficient values:
- Explanation: The Linear Learner algorithm is a supervised machine learning algorithm in SageMaker that can be used to train a regression model. After training, the model’s coefficients can be ranked by their absolute values, indicating feature importance.
- Why selected: Linear regression models inherently provide coefficients, which can easily be interpreted to assess feature importance. The method is direct and lightweight, requiring minimal operational overhead.
- Why rejected: While effective for simple relationships, linear regression might not capture complex, nonlinear relationships that could be crucial in stock return predictions. Additionally, if there are many highly correlated features, the model might not perform optimally.
B) Use random forest regression in SageMaker to train a model to predict the stock returns. Identify the most predictive features based on Gini importance scores:
- Explanation: Random Forest Regression is an ensemble method that can handle nonlinear relationships and interactions between features. It provides feature importance scores based on metrics like Gini impurity or mean decrease in accuracy.
- Why rejected: While Random Forest can handle nonlinearities better than linear regression and might provide more accurate results, it is computationally more expensiv...
Author: Carlos Garcia · Last updated Apr 3, 2026
A company is using a machine learning (ML) model to recommend products to customers. An ML specialist wants to analyze the data for the most popular recommendations in four dimensions.
The ML specialist will visualize the first two dimensions as coordinates. The third dimension will be visualized as color. ...
To meet the ML specialist’s requirement of visualizing the data with four dimensions — where two dimensions are represented as coordinates, one as color, and one as size — we need to choose a solution that supports this type of multi-dimensional data visualization. Let's evaluate each option based on these criteria.
Key Requirements:
1. Two dimensions as coordinates: These two dimensions will be visualized on the X and Y axes.
2. Third dimension as color: This will allow for distinguishing between different categories or ranges within that dimension.
3. Fourth dimension as size: The size of the data points will represent the magnitude of this dimension.
---
Evaluating Each Option:
A) Use the Amazon SageMaker Data Wrangler bar chart feature. Use Group By to represent the third and fourth dimensions:
- Explanation: The bar chart feature in Amazon SageMaker Data Wrangler is useful for showing discrete data across categories. "Group By" can help categorize data by the third and fourth dimensions.
- Why rejected: A bar chart is not suitable for visualizing data in a continuous, multi-dimensional space, particularly when needing to represent two dimensions as coordinates and the others as size and color. Bar charts are not appropriate for representing continuous variables or multi-dimensional relationships in this context.
B) Use the Amazon SageMaker Canvas box plot visualization. Use color and fill pattern to represent the third and fourth dimensions:
- Explanation: A box plot is used to visualize the distribution of a dataset and can show statistical summaries like quartiles and outliers. Color and fill patterns can be used to differentiate groups or categories within the data.
- Why rejected: A box plot is not designed for representing data as coordinates (X, Y axes). While it can display distributions and differences between groups, it does not allow the flexibility needed for visu...
Author: Nathan · Last updated Apr 3, 2026
A clothing company is experimenting with different colors and materials for its products. The company stores the entire sales history of all its products in Amazon S3. The company is using custom-built exponential smoothing (ETS) models to forecast demand for its current products. The company needs to ...
To meet the clothing company’s requirements for forecasting demand for a new product variation, we need a solution that efficiently handles time-series forecasting for products with no prior sales history. The company has a custom-built exponential smoothing (ETS) model for its current products but now needs to predict demand for a new product variation, which likely lacks historical data or sales trends. Let’s evaluate each option based on this scenario:
Key Requirements:
- Forecasting demand for a new product variation: The solution must be able to predict future demand for a product that has no prior sales history, making it difficult to rely on traditional time-series methods like ETS, which require historical data.
- Use of existing data: The company has a sales history for its current products in Amazon S3, which can potentially be used to derive patterns for the new product.
---
Evaluating Each Option:
A) Train a custom ETS model:
- Explanation: ETS models, which rely on exponential smoothing, are designed for time-series forecasting. They work well when there is enough historical data to capture seasonal patterns and trends. However, for a new product variation that has no sales history, ETS would struggle to generate meaningful forecasts.
- Why rejected: Since the new product variation does not have historical sales data, an ETS model cannot be effectively trained for this product. ETS models work well for products with a history, but they are not suitable for forecasting demand for entirely new products.
B) Train an Amazon SageMaker DeepAR model:
- Explanation: DeepAR is a probabilistic forecasting model developed by Amazon, and it is specifically designed for time-series forecasting in scenarios where there may be limited or no historical data for individual items. DeepAR can use data from similar products or other relevant features to generate forecasts even for products with no prior ...
Author: Lucas · Last updated Apr 3, 2026
A company makes forecasts each quarter to decide how to optimize operations to meet expected demand. The company uses ML models to make these forecasts.An AI practitioner is writing a report about the trained ML models to provide transparency and explainability to company stak...
To meet the transparency and explainability requirements of the ML models used by the company for forecasts, the AI practitioner needs to provide insights that help stakeholders understand how the models make predictions and how reliable those predictions are. Let's evaluate the options based on the needs for transparency, explainability, and their relevance to the context of the trained ML models.
A) Code for model training
- Purpose: The code for training the model contains the algorithms and methods used to train the model. It provides detailed information about how the model is constructed and trained.
- Reason for rejection: While providing the training code can offer technical transparency, it is generally not directly useful for non-technical stakeholders in understanding how the model works. Stakeholders are likely more interested in understanding the outputs of the model and how those outputs were generated in a comprehensible manner. The training code itself may be too detailed and technical for the report's audience.
- Scenario where it can be used: Providing code for model training is helpful for developers or data scientists who need to replicate or modify the training process, but it doesn't directly enhance the transparency or explainability of the model for business stakeholders.
B) Partial dependence plots (PDPs)
- Purpose: Partial dependence plots (PDPs) show the relationship between a feature (or set of features) and the model's predictions, helping to visualize how changes in specific features affect the model's output. This is a powerful tool for explaining the influence of input features on the model’s decision-making process.
- Reason for selection: PDPs provide a clear, intuitive way for stakeholders to understand how different features impact the model's predictions, offering transparency into how the model interprets data. This is a great way to fulfill the explainability requirement as it helps explain the model behavior in an interpretable way.
- Scenario where it can be used: PDPs are useful when the model is non-linear or ...
Author: James · Last updated May 7, 2026
A law firm wants to build an AI application by using large language models (LLMs). The application will read legal documents and extract key poi...
To determine the best solution for a law firm wanting to build an AI application that reads legal documents and extracts key points, we need to assess the requirements carefully, focusing on the need for reading, understanding, and extracting specific information from legal documents. Let’s evaluate each option:
A) Build an automatic named entity recognition (NER) system
- Operational Overhead: Named Entity Recognition (NER) identifies specific entities (like names of people, organizations, dates, and locations) within text. This could certainly be helpful for extracting legal terms, but it only identifies entities, not necessarily the key points or summaries of the document.
- Suitability: NER is a good tool for extracting certain details from legal documents, but it doesn't directly meet the broader need of extracting key points from the documents. Key points often involve more nuanced understanding, such as summarizing or interpreting relationships and events, which goes beyond simple entity recognition.
- Conclusion: Not ideal. NER focuses on extracting specific entities but doesn’t fully address the need to extract broader key points or summaries from legal documents.
B) Create a recommendation engine
- Operational Overhead: A recommendation engine is designed to suggest items based on past behavior or preferences, such as recommending articles, documents, or case law based on past searches or interactions. While this can be useful for suggesting legal documents or articles, it doesn't help in extracting key points or summarizing the contents of legal documents.
- Suitability: The problem is about summarizing and extracting key points from documents, which is not the focus of a recommendation engine. The engine would not analyze or extract information from documents in the way the law firm needs.
- Conclusion: Not suitable. Recommendation engines don’t fulfill the need for understanding and summarizing the content of legal documents.
C) Develop a summarization chatbot
- Operational Overhead: A summarization chatbot, especially one built using large language models (LLMs), could read and understand...
Author: Krishna · Last updated May 7, 2026
A company wants to classify human genes into 20 categories based on gene characteristics. The company needs an ML algorithm to document how the inner mechanism of th...
To determine which machine learning (ML) algorithm meets the requirements of classifying human genes into 20 categories and documenting how the inner mechanism of the model affects the output, let's analyze each option based on interpretability, explainability, and classification suitability.
A) Decision Trees
- Operational Overhead: Decision trees are relatively simple algorithms that split data based on feature values to classify instances. They naturally provide a clear, visual representation of how the model makes decisions, making them highly interpretable.
- Suitability: Decision trees are great for classification tasks, and they allow you to understand the decision-making process of the model by following the series of splits and rules it applies. This makes it ideal for documenting how the inner mechanism affects the output, which is explicitly required in the question.
- Cost and Time: Decision trees are easy to train and do not require extensive computational resources. They are quick to implement and understand.
- Conclusion: Best option. Decision trees provide transparency into the model's decision-making process, which is essential for documenting the inner workings and how specific gene characteristics influence the classification.
B) Linear Regression
- Operational Overhead: Linear regression is typically used for predicting continuous variables, not for classification tasks. While it's relatively easy to understand (as it involves predicting a target value based on a linear relationship with input features), it is not suitable for multi-class classification, such as classifying genes into 20 categories.
- Suitability: Linear regression would not be ideal for the task of classifying genes into multiple categories, as it doesn't directly handle categorical outputs.
- Conclusion: Not suitable. Linear regression is not designed for multi-class classification and doesn’t match the problem's needs.
C) Logistic Regression
- Operational Overhead: Logistic regression is a classification algorithm commonly used for binary classification. While it can be extended to handle multiple classes (multinomial logistic regression), it is less interpre...
Author: SilverBear · Last updated May 7, 2026
A company has built an image classification model to predict plant diseases from photos of plant leaves. The company wants to evaluate how many images the model classified correctly.Wh...
To evaluate the performance of a model that predicts plant diseases from images of plant leaves, the goal is to determine how many images the model classified correctly. Let’s analyze the options based on this specific task.
Option A: R-squared score
- Efficiency: R-squared is primarily used for regression tasks, where the goal is to measure the proportion of variance explained by the model. It doesn't apply to classification problems, which is the type of task this company is handling (image classification).
- Cost: Using R-squared in this context would lead to inefficiencies because the metric wouldn't provide meaningful insight for classification.
- Time: Calculating R-squared for a classification model would waste time and resources, as it's not the appropriate metric.
- Key Factor: R-squared doesn't provide information about the accuracy of predictions in a classification problem, so it is unsuitable here.
Option B: Accuracy
- Efficiency: Accuracy is the most straightforward and commonly used metric for classification tasks, especially when the goal is to measure how many instances were correctly classified.
- Cost: Accuracy calculation is computationally inexpensive. The company simply needs to compare the predicted labels with the true labels to determine how many were correctly classified.
- Time: The time required to compute accuracy is minimal, making it very efficient for evaluating model performance.
- Key Factor: Since the task is image classification (predicting plant diseases from images), accuracy will directly show the number of correct ...
Author: Nathan · Last updated May 7, 2026
A company is using a pre-trained large language model (LLM) to build a chatbot for product recommendations. The company needs the LLM outputs to be short and written in a specific language.W...
To ensure that the LLM outputs align with the company’s expectations for short responses and a specific language, we need to carefully analyze the given options based on the services, effort, time, cost, and expected outcomes.
A) Adjust the prompt
Adjusting the prompt is a highly effective and low-cost solution. The prompt guides the model on how to behave, what tone to use, and what specific instructions to follow. For example, the company can include clear instructions in the prompt like: "Provide short, concise responses in [specific language]." Adjusting the prompt allows for quick fine-tuning of the model's output without needing additional complex configurations or modifications to the model itself. It’s also a very cost-effective approach as it leverages the existing pre-trained model. This solution minimizes effort and time while achieving the desired output quality.
B) Choose an LLM of a different size
Choosing a different model size (e.g., using a smaller or larger model) could potentially change the behavior of the chatbot, but it does not directly control the output length or language. Larger models might generate more detailed responses, while smaller ones might generate shorter ones, but there's no guarantee that the output will be exactly aligned with the company’s requirements. Additionally, switching to a different model size could involve more cost and effort, including re-training or fine-tuning, which could be time-consuming.
C) Increase the temperature
The temperature controls the randomness of the model’s responses. A higher temperature (e.g., close to 1) ma...
Author: Emma · Last updated May 7, 2026
A company uses Amazon SageMaker for its ML pipeline in a production environment. The company has large input data sizes up to 1 GB and processing times up to 1 hour. The company needs n...
In this case, the company needs to process large input data sizes (up to 1 GB) and achieve near real-time latency, with the input data taking up to 1 hour for processing. To meet these requirements, we need to assess the SageMaker inference options based on processing speed, scalability, cost, and ease of management.
A) Real-time inference
- Benefits: Real-time inference in Amazon SageMaker allows for low-latency responses to incoming requests. It is designed for applications where immediate predictions are required, making it a good fit for scenarios that demand near real-time performance.
- Drawback: While real-time inference is designed for low-latency, serving large input data (like 1 GB) in real-time could lead to performance issues, such as slow response times, depending on the model size and compute resources allocated. Real-time endpoints are also generally optimized for smaller payloads.
- Use Case: This option could be suitable if the company can manage the performance of large payloads and can provision high-performance instances to handle the large input data size quickly. However, for very large data sizes and longer processing times (up to 1 hour), real-time inference may not be ideal unless the infrastructure can handle it efficiently.
B) Serverless inference
- Benefits: Serverless inference in SageMaker automatically scales based on the number of incoming requests and eliminates the need to manage infrastructure. It is cost-effective for infrequent or unpredictable workloads, as you only pay for the compute used.
- Drawback: While serverless inference can scale automatically and is simple to manage, it might not provide the necessary speed and performance for large datasets (1 GB) and longer processing times (up to 1 hour) when near real-time latency is required.
- Use Case: This is best for small, low-volume applications that do not require high compute capacity or fast processing times. It is not optimal for workloads requiring consistent high performance with large data sizes and low latency...
Author: Rahul · Last updated May 7, 2026
A company is using domain-specific models. The company wants to avoid creating new models from the beginning. The company instead wants to adapt pre-trained models to creat...
To meet the company’s goal of adapting pre-trained models for new, related tasks instead of creating new models from scratch, we need to focus on a strategy that allows leveraging existing models efficiently. Here’s a breakdown of the options:
A) Increase the number of epochs
- Reasoning: Increasing the number of epochs involves training the model for more iterations, which can help the model learn better from the data. However, this doesn’t specifically address the need for adapting pre-trained models. It’s a generic technique used in training, not specifically suited for adapting existing models to new tasks.
- Rejected because: This strategy is focused on improving the training process rather than adapting pre-trained models.
B) Use transfer learning
- Reasoning: Transfer learning is the process of taking a pre-trained model (that was trained on one task) and adapting it to a new, but related, task. This strategy allows the company to save time and resources by utilizing the knowledge learned from the original model, which can be fine-tuned for the new task. Transfer learning is specifically designed to address scenarios like this, where a company wants to avoid starting from scratch and instead build upon existing models.
- Accepted because: Transfer learning is the ideal strategy to adapt pre-trained models for new, rel...
Author: IceDragon2023 · Last updated May 7, 2026
A company is building a solution to generate images for protective eyewear. The solution must have high accuracy and must minimize the risk of inc...
To generate accurate images for protective eyewear, the company must ensure high-quality annotations and minimize the risk of errors. This requires leveraging tools that can either improve annotation accuracy or provide robust image generation and recognition capabilities.
Option Analysis:
A) Human-in-the-loop validation by using Amazon SageMaker Ground Truth Plus
- Pros: Amazon SageMaker Ground Truth Plus allows the company to combine machine learning with human validation to create high-quality labeled data for training models. This human-in-the-loop approach reduces the risk of incorrect annotations and ensures high accuracy in labeling, which is critical when generating images for protective eyewear. Human reviewers can catch errors that automated systems might miss, improving the overall reliability of the dataset.
- Cons: Using Ground Truth Plus requires additional effort to set up and manage the human review process. This introduces a time and cost overhead due to the need for human annotators. However, the quality of the annotations will significantly improve, making it suitable for use cases where accuracy is paramount.
- When to use: This is ideal when the accuracy of annotations is critical, and the company needs a reliable solution for creating high-quality datasets, especially in cases where automated annotations might introduce errors.
B) Data augmentation by using an Amazon Bedrock knowledge base
- Pros: Data augmentation can enhance the variety and volume of training data by applying transformations to existing images. It can be helpful in creating a more diverse dataset for training machine learning models.
- Cons: While data augmentation improves the training data, it doesn’t directly address the requirement for minimizing annotation errors. It does not generate images from scratch, nor does it validate annotations for accuracy.
- When to use: Data augmentation is useful when there’s a need to improve the diversity of the dataset, but it is not ideal for minimizing annotat...
Author: Sofia2021 · Last updated May 7, 2026
A company wants to create a chatbot by using a foundation model (FM) on Amazon Bedrock. The FM needs to access encrypted data that is stored in an Amazon S3 bucket. The data is encrypted with Amazon S3 managed keys (SSE-S3).The FM encount...
Let's evaluate each option based on the requirement of accessing encrypted data stored in an S3 bucket, the FM (foundation model) running on Amazon Bedrock, and the SSE-S3 encryption applied to the data.
Option A: Ensure that the role that Amazon Bedrock assumes has permission to decrypt data with the correct encryption key
- Description: This option ensures that the role assumed by Amazon Bedrock has the proper permissions to decrypt the data in the S3 bucket using the correct Amazon S3 encryption key (SSE-S3 in this case).
- Key factors:
- Encryption management: SSE-S3 encryption uses a default encryption key that Amazon manages. The FM needs permission to access and decrypt this data.
- Effort & time: Setting the proper IAM permissions is straightforward and can be done by modifying the role's policy to include permissions for `s3:GetObject` and `s3:Decrypt`.
- Cost: No additional costs beyond the typical S3 usage and Bedrock service charges.
- Why it's suitable: This directly addresses the problem — the FM cannot access encrypted data without the right permissions to decrypt it. The solution requires minimal effort to configure IAM roles, and it meets the requirement without overcomplicating the system.
Option B: Set the access permissions for the S3 buckets to allow public access to enable access over the internet
- Description: This option suggests making the S3 bucket publicly accessible to allow anyone, including Amazon Bedrock, to access the data over the internet.
- Key factors:
- Security risk: Allowing public access to sensitive data is a major security risk. It exposes the data to unauthorized access, which is a serious issue, especially if the data is sensitive.
- Effort & time: While this might seem easy to implement, it introduces unnecessary risk and goes against best practices for data security.
- Cost: No additional cost, but the potential for data leakage or misuse can lead to severe reputational and financial consequences.
- Why it's not suitable: Public access to the S3 bucket would violate security best practices. Encrypting data implies that you want to protect it, and public access would undermine this security. This is not a recommended solution.
Option C: Use prompt engineering techniques to tell the...
Author: Leo · Last updated May 7, 2026
A company wants to use language models to create an application for inference on edge devices. The inference must have the lowest lat...
In this case, the goal is to achieve the lowest latency possible for inference on edge devices, considering factors such as services, effort, time, cost, and other key aspects.
A) Deploy optimized small language models (SLMs) on edge devices
Deploying optimized small language models (SLMs) directly on the edge devices is a viable option for low-latency inference. Small models typically require less computational power and memory, making them suitable for edge devices with limited resources. These models can run locally, meaning no communication with a remote server is necessary, resulting in minimal latency. Additionally, this option is cost-effective since it avoids the need for constant network communication.
- Pros:
- Very low latency, as inference happens directly on the edge device.
- Lower resource usage (less memory and processing power required).
- No dependency on external network connectivity.
- Cons:
- Small models might lack the power and flexibility of larger models, potentially leading to less accurate or more limited capabilities.
- Requires optimization efforts to ensure the models perform well on edge devices.
Best use case: Edge devices with limited computational resources but require fast, efficient, local inference, such as simple language understanding tasks.
B) Deploy optimized large language models (LLMs) on edge devices
Deploying large language models (LLMs) on edge devices would likely introduce significant latency. LLMs require substantial resources (memory, CPU/GPU), which might not be available on edge devices. Even when optimized, LLMs still tend to be too heavy for most edge devices to handle, especially if the goal is minimizing latency.
- Pros:
- Potentially high accuracy and advanced capabilities.
- Cons:
- High resource consumption (CPU, memory, and power), which edge devices might not have.
- Increased latency due to the size and computational requirements of LLMs.
- May not be feasible or cost-effective for edge deployment.
Best use case: Scenarios where edge devices have substantial computational power and are capable of handling large models, such as specialized high-performance edge devices.
C) Incorporate a...
Author: Olivia Johnson · Last updated May 7, 2026
A company wants to build an ML model by using Amazon SageMaker. The company needs to share and manage variables for model development across mu...
To determine which Amazon SageMaker feature best meets the company's needs for sharing and managing variables (or features) across multiple teams for model development, let's analyze each option:
A) Amazon SageMaker Feature Store
- Operational Overhead: Amazon SageMaker Feature Store is a fully managed, centralized repository to store, update, and retrieve features for machine learning models. It enables sharing and versioning of features across teams and allows the use of consistent data for training and inference. It also supports both real-time and batch access to feature data.
- Suitability: Since the company wants to share and manage variables (features) across multiple teams, SageMaker Feature Store is the ideal choice. It is designed specifically to centralize and manage features, making them easily accessible for various teams involved in model development.
- Cost and Time: The service is fully managed, which reduces operational overhead. While there may be some costs involved in storing and retrieving data, these are typically outweighed by the benefits of centralized feature management.
- Conclusion: Best option. It is specifically designed for managing and sharing features (variables) for machine learning models across teams.
B) Amazon SageMaker Data Wrangler
- Operational Overhead: Amazon SageMaker Data Wrangler simplifies data preparation by providing a graphical interface for cleaning, transforming, and analyzing data before training models. While it helps with data preparation, its primary focus is not on managing features across multiple teams.
- Suitability: Data Wrangler is useful for preprocessing data for machine learning, but it does not offer centralized storage or version control of features like Feature Store does. It’s more about preparing data for a specific model rather than managing features across teams over time.
- Conclusion: Not ideal. While it helps with ...
Author: Noah · Last updated May 7, 2026
A company wants to use generative AI to increase developer productivity and software development. The company wants to use Amazon Q Developer.What can ...
The company wants to use generative AI to increase developer productivity and software development. The key requirement is to leverage Amazon Q Developer, which is a generative AI tool specifically designed to assist developers. Let’s go through the options one by one and assess which one aligns best with this need.
Option A: Create software snippets, reference tracking, and open source license tracking.
- Purpose: This option seems focused on streamlining the development process by automating or simplifying certain tasks like creating software snippets, tracking references, and managing open-source licenses. While these features can be useful for increasing productivity, they don't fully leverage the generative AI capabilities Amazon Q Developer is known for.
- Reason for rejection: This option does not directly address the goal of using generative AI to enhance developer productivity by enabling more intelligent or efficient coding workflows (e.g., code generation, optimization, etc.). It is more focused on auxiliary tasks like tracking and managing references.
Option B: Run an application without provisioning or managing servers.
- Purpose: This is the core functionality of AWS Lambda—running applications without the need for server management. While this can help scale applications and simplify infrastructure management, it doesn't address generative AI in the context of software development, which is the focus here.
- Reason for rejection: This is more related to infrastructure management rather than using AI to assist with coding or software development directly. The question is about increasing developer productivity with generative AI, not about serverless computing.
Option C: Enable voice commands for coding and providing natural language search.
- Purpose: This option directly aligns with the c...
Author: Aarav · Last updated May 7, 2026
A financial institution is using Amazon Bedrock to develop an AI application. The application is hosted in a VPC. To meet regulatory compliance standards, the VPC is not allowed access t...
In this scenario, the financial institution is using Amazon Bedrock for an AI application hosted in a VPC, and to meet regulatory compliance standards, the VPC must not have access to any internet traffic. Therefore, the chosen solution should prevent any internet traffic from reaching the VPC, while still allowing the application to operate effectively.
Option Analysis:
A) AWS PrivateLink:
- Description: AWS PrivateLink provides private connectivity between VPCs and AWS services or third-party services without exposing the traffic to the public internet. It allows you to securely access services over private IPs within the VPC.
- Reasoning: AWS PrivateLink is ideal in this case because it allows secure and private access to Amazon Bedrock and other AWS services without needing any internet access. This can meet the requirement of not allowing the VPC to access internet traffic while still being able to interact with necessary services like Amazon Bedrock in a secure manner.
- Scenario Usage: This option is highly suitable when you need to connect to AWS services privately from within your VPC without internet exposure, which fits well with the regulatory compliance requirement of no internet access.
B) Amazon Macie:
- Description: Amazon Macie is a data security service that uses machine learning to automatically discover, classify, and protect sensitive data.
- Reasoning: While Amazon Macie is useful for data protection, it doesn’t directly address the need to prevent internet access. Macie is more focused on identifying and protecting sensitive data rather than controlling network access to the internet.
- Scen...
Author: BlazingPhoenix22 · Last updated May 7, 2026
A company wants to develop an educational game where users answer questions such as the following: "A jar contains six red, four green, and three yellow marbles. What is the probability of choosing a green marble...
To determine the best solution, we need to evaluate the operational overhead, efficiency, and appropriateness of each option. Let's break down each choice:
A) Use supervised learning to create a regression model that will predict probability.
- Operational Overhead: Supervised learning requires labeled data, model training, and optimization, which can be time-consuming and computationally expensive.
- Suitability: The problem is a simple calculation that does not need predictive modeling. Supervised learning models like regression are overkill for a task that only requires basic probability computation.
- Conclusion: Not ideal. The complexity and resource requirements are unnecessary for this simple question.
B) Use reinforcement learning to train a model to return the probability.
- Operational Overhead: Reinforcement learning requires extensive training through rewards and punishments based on trial and error. This is very computationally intensive, and it’s not suited for deterministic problems like calculating probabilities.
- Suitability: Reinforcement learning is designed for decision-making tasks where an agent interacts with an environment. It is highly inappropriate for a simple, static question like the one posed.
- Conclusion: Overcomplicated and inefficient for this case.
C) Use code that will calculate probability by using simple rules and computations.
- Operational Overhead: This is by far the simplest and most efficient solution. You simply need to calculate the p...
Author: ElectricLionX · Last updated May 7, 2026
Which metric measures the runtime efficiency of operating AI models?
To determine the metric that measures the runtime efficiency of operating AI models, we need to evaluate how each metric reflects the efficiency of AI model execution, particularly in terms of how fast or resource-efficient the model operates in production or deployment.
A) Customer satisfaction score (CSAT)
CSAT measures the satisfaction of end users or customers, typically via surveys, but it is not directly related to the runtime efficiency of an AI model. While customer satisfaction is crucial for evaluating the user experience, it does not measure how efficiently a model performs during inference or how quickly it provides results.
Rejection Reason: CSAT measures user experience and feedback, not the runtime performance or efficiency of AI models in terms of computational time or resources.
B) Training time for each epoch
Training time for each epoch measures how long the model takes to train over one iteration of the entire dataset. This metric is relevant for evaluating the time efficiency of the training process, but it is not an indicator of the runtime efficiency during inference (i.e., when the model is being used in production to make predictions).
Rejection Reason: While this metric reflects training efficiency, it doesn't measure the efficiency of the model during actual use or inference, which is the focus of the question.
C) Average response time
Average response time refers to how quickly an AI model responds to reques...
Author: Aria · Last updated May 7, 2026
A company is building a contact center application and wants to gain insights from customer conversations. The company wants to analyze and extract key information from...
Let's break down the options based on the requirements: analyzing and extracting key information from the audio of customer calls.
A) Build a conversational chatbot by using Amazon Lex:
Amazon Lex is a service designed for building conversational chatbots. It is effective for creating real-time conversational interfaces like chatbots, but it doesn't directly address the need to analyze and extract key information from recorded customer call audio. Lex focuses on dialogue management, understanding user intent, and responding to queries, but not on transcribing or extracting information from recorded audio. Therefore, it is not suitable for the task of analyzing past customer call recordings.
B) Transcribe call recordings by using Amazon Transcribe:
Amazon Transcribe is a speech-to-text service that can convert audio recordings into text. This is a crucial step in analyzing call recordings, as transcription is the foundation for extracting insights from spoken content. Once the audio is transcribed into text, the company can further process the text to extract key information, which is aligned with the company’s goal of gaining insights from customer conversations. This solution is both cost-effective and time-efficient for the specific task of converting audio into text for further analysis.
C) Extract information from call recordings by using Amazon SageMaker Model Monitor:
Amazon SageMaker Model Monitor is a tool designed to monitor machine learning models in production, primarily...
Author: Olivia · Last updated May 7, 2026
A company has petabytes of unlabeled customer data to use for an advertisement campaign. The company wants to classify its customers into tiers to advertise and promote the company's...
The company has petabytes of unlabeled customer data and wants to classify its customers into tiers for an advertisement campaign. To meet these requirements, let's analyze the options:
Option A: Supervised learning
- Reasoning: Supervised learning requires labeled data, where the model is trained on input-output pairs. However, the company has unlabeled data, which makes supervised learning an inappropriate choice for this scenario. While it is effective for tasks where labels are available, it won't work here because the company doesn't have predefined labels (i.e., customer tiers).
- Effort: High. Gathering and labeling the data would be a significant effort, especially with petabytes of data.
- Time: High. Labeling a large dataset would take considerable time.
- Cost: High. The cost of labeling data and training a supervised model would be significant.
- Why rejected: Supervised learning requires labeled data, which is not available in this case. Therefore, it is not the right approach for this problem.
Option B: Unsupervised learning
- Reasoning: Unsupervised learning is ideal for tasks involving unlabeled data. In this case, the company can use unsupervised learning techniques to identify patterns or clusters within the customer data and group them into tiers based on similarities. This approach can automatically categorize customers into different segments or clusters, making it a natural fit for the given scenario.
- Effort: Medium. The effort is mainly in selecting and applying the right unsupervised learning algorithms (e.g., clustering algorithms like K-means or DBSCAN).
- Time: Medium. The model would be able to identify clusters relatively quickly without the need for labels.
- Cost: Medium. The costs will depend on the scale and complexity of the data, but it is generally lower than supervised learning.
- Why selected: Unsupervised learning is the most appropriate methodology because it can classify customers into different tiers or segments without requiring labeled data. This fits t...
Author: MoonlitPantherX · Last updated May 7, 2026
An AI practitioner wants to use a foundation model (FM) to design a search application. The search application must handle queries that have text and images.Which ty...
To design a search application that can handle both text and image queries, it's essential to select a foundation model (FM) that is capable of understanding and processing both types of inputs effectively. Let’s analyze the options based on the specific requirements of the task.
Option A: Multi-modal embedding model
- Efficiency: A multi-modal embedding model is designed to handle both text and image inputs by learning a shared representation space for both modalities (text and images). This is highly efficient for tasks like search, where the goal is to match queries (which can consist of both text and images) to relevant content in the database.
- Cost: Multi-modal models can be computationally intensive, but they are typically well-suited for handling multi-modal queries in a search application. The cost is justifiable because the model directly addresses the need for understanding and comparing both text and images.
- Time: These models provide a quick and effective way to process queries that involve both text and images. Once trained, the time taken for retrieval and ranking is optimized for multi-modal inputs.
- Key Factor: Multi-modal embedding models are ideal for search applications because they create a joint embedding space for text and images, enabling the model to retrieve and rank results based on both types of inputs efficiently.
Option B: Text embedding model
- Efficiency: While a text embedding model can effectively process and generate embeddings for textual data, it is limited to handling only text. It cannot understand or process image queries, which makes it unsuitable for a search application that requires handling both text and images.
- Cost: A text embedding model is less computationally expensive compared to multi-modal models, but it is not sufficient for this task since it would fail to handle image-based queries.
- Time: It would be faster to deploy a text-only model, but it wouldn't provide any value when handling mixed (text ...
Author: Liam · Last updated May 7, 2026
A company uses a foundation model (FM) from Amazon Bedrock for an AI search tool. The company wants to fine-tune the model to be more accurate by using the ...
Option Analysis:
A) Provide labeled data with the prompt field and the completion field:
- Reasoning: Fine-tuning a foundation model typically involves supervised learning, where you provide the model with examples of prompts and the desired completions (i.e., the correct responses or outputs for each input). This aligns well with the concept of supervised fine-tuning, where labeled data is used to improve the model's accuracy for a specific task (in this case, an AI search tool).
- Use case: This approach is ideal for fine-tuning a foundation model. The prompt field represents the input query, and the completion field represents the model's expected output. This would allow the model to learn from the company's specific data and improve its relevance and accuracy for the company's use case.
- Conclusion: Selected.
B) Prepare the training dataset by creating a .txt file that contains multiple lines in .csv format:
- Reasoning: While preparing the dataset in a specific format (such as .txt or .csv) is crucial for training models, the description here is a bit ambiguous. The format should be structured appropriately to include labeled data, not just a generic .txt or .csv file. The focus should be on the contents of the data (labeled prompt-completion pairs), not the file format itself.
- Use case: This option may be a part of preparing the data, but the format alone doesn’t address the fine-tuning strategy. Fine-tuning is about how the data is presented to the model, not just the file type.
- Conclusion: Rejected.
C) Purchase Provisioned Throughput for Amazon Bedrock:
- Reasoning: Provisioned throughput allows you to allocate a specific level of compute resources for consistent ...
Author: Zara · Last updated May 7, 2026
A company wants to use AI to protect its application from threats. The AI solution needs to check if an IP address is from a sus...
To determine which AI solution best fits the requirement of protecting an application from threats by checking if an IP address is from a suspicious source, we need to analyze each option based on factors like effort, time, cost, and alignment with the specific task of detecting suspicious IP addresses.
1. Build a speech recognition system:
- Relevance: A speech recognition system is designed to convert spoken language into text. This does not relate to identifying suspicious IP addresses or detecting threats.
- Reason for rejection: This solution is not aligned with the task and would not be effective in protecting against threats based on IP addresses.
- Scenario: This option could be used for applications related to voice assistants or transcription but is irrelevant for threat detection.
2. Create a natural language processing (NLP) named entity recognition (NER) system:
- Relevance: NLP with NER focuses on identifying entities (like names, places, organizations) within text. While useful for analyzing text-based data, it doesn’t have the capability to detect suspicious IP addresses or threats.
- Reason for rejection: NER doesn't address the task of IP threat detection.
- Scenario: This could be used in applications like extracting named entities from legal documents or social media posts, but not for network security.
3. Develop an anomaly detection system:
- Relevance: Anomaly detection is a technique that identifies patterns in data that do not conform to expected behavior. It can be used to detect suspicious activi...
Author: Aditya · Last updated May 7, 2026
Which feature of Amazon OpenSearch Service gives companies the ability to build vector database appl...
To build vector database applications using Amazon OpenSearch Service, the critical feature to focus on is the Scalable index management and nearest neighbor search capability (Option C). This feature allows companies to efficiently store and search high-dimensional vector data, which is crucial for applications such as recommendation systems, semantic search, and image or text similarity matching, among others.
Analysis of each option:
1. A) Integration with Amazon S3 for object storage
- While integrating Amazon OpenSearch Service with S3 helps store large volumes of data in the cloud, it doesn’t directly impact the ability to build vector database applications. S3 provides storage but doesn’t offer the specialized index management or nearest neighbor search functionalities needed for vector databases.
2. B) Support for geospatial indexing and queries
- This feature is primarily designed for applications that need to perform spatial queries, such as those dealing with geographical locations. While useful for certain types of searches, geospatial indexing does not directly support building vector databases, as vector databases are more focused on high-dimensional, non-geospatial data for tasks...