Which term describes risk where data used to train or validate the model can be manipulated to embed backdoors or biases?

Prepare for the AAISM Domain 2 Test. Engage with multiple choice questions, each offering hints and explanations to boost your understanding. Get ready for success in your exam!

Multiple Choice

Which term describes risk where data used to train or validate the model can be manipulated to embed backdoors or biases?

Explanation:
Data poisoning is the risk where the data used to train or validate a model is tampered with to embed backdoors or biases. By injecting carefully crafted examples into the training set, an attacker can make the model learn hidden behaviors that only activate under specific triggers, effectively creating a backdoor. Poisoned validation data can also skew evaluation results and encourage biased or unsafe model behavior once deployed, because the model has learned from misleading signals. Other terms describe different issues: improper output handling concerns how outputs are managed or sanitized, unbounded consumption relates to resource or data ingestion limits, and system prompt leakage involves hidden instructions or prompts leaking into the model’s behavior.

Data poisoning is the risk where the data used to train or validate a model is tampered with to embed backdoors or biases. By injecting carefully crafted examples into the training set, an attacker can make the model learn hidden behaviors that only activate under specific triggers, effectively creating a backdoor. Poisoned validation data can also skew evaluation results and encourage biased or unsafe model behavior once deployed, because the model has learned from misleading signals. Other terms describe different issues: improper output handling concerns how outputs are managed or sanitized, unbounded consumption relates to resource or data ingestion limits, and system prompt leakage involves hidden instructions or prompts leaking into the model’s behavior.

Subscribe

Get the latest from Examzify

You can unsubscribe at any time. Read our privacy policy