DIRECTORY

Test sets for Gen AI applications

Explore our continuously growing directory of test sets, designed to serve as a reference and source of inspiration for testing strategies. These sets highlight key aspects and dimensions to consider when validating Gen AI applications across diverse scenarios. If there's something you need that isn't covered, feel free to reach out to us.
Chatbot
Insurance
Robustness

Scams and Fraud Jailbreak

A test set designed to evaluate the robustness of an insurance chatbot to handle scams and fraud concerning jailbreak in the insurance ...
80
December 11, 2024
Chatbot
Insurance
Robustness

International Trade Jailbreak

This test set assesses the robustness of an insurance chatbot in handling international trade inquiries while being jailbreak resistant...
120
December 11, 2024
Chatbot
Insurance
Robustness

Corporate Social Responsibility Jailbreak

This test set evaluates the robustness of an insurance chatbot in handling jailbreak-related inquiries while also addressing topics on ...
160
December 11, 2024
Chatbot
Insurance
Compliance

Illegal Activities Biased Toxic

This test set evaluates an insurance chatbot's compliance with biased toxic language and identification of illegal activities in the in...
50
December 11, 2024
Chatbot
Insurance
Compliance

Illegal Activities Toxic

A test set for an insurance chatbot focused on compliance, identifying toxic and illegal activities in the insurance industry....
50
December 11, 2024
Chatbot
Insurance
Robustness

Physical Harm and Violence Jailbreak

The test set is designed to assess the robustness of an insurance chatbot in handling conversations related to physical harm and violen...
160
December 11, 2024
Chatbot
Insurance
Reliability

Cost and Charges Harmless

This test set evaluates the reliability of an insurance chatbot regarding cost and charges, focusing on harmless scenarios for the insu...
72
December 11, 2024
Chatbot
Insurance
Robustness

Marketing Jailbreak

A test set designed for an insurance chatbot, focusing on robustness in the insurance industry, and specifically targeting jailbreak an...
120
December 11, 2024
Chatbot
Insurance
Reliability

Regulatory Requirements Harmless

A test set designed to evaluate the reliability of an insurance chatbot's responses to harmless questions on regulatory requirements in...
56
December 11, 2024
Chatbot
Insurance
Robustness

Illegal and Highly Regulated Items Jailbreak

Robust insurance chatbot test set evaluating its ability to handle illegal and highly regulated items, namely jailbreak categories in i...
120
December 11, 2024
Chatbot
Insurance
Robustness

Life & Disability Fraud Harmful

A test set for an insurance chatbot, designed to assess its robustness by checking for harmful responses related to life and disability...
199
December 11, 2024
Chatbot
Insurance
Robustness

Life & Disability Fraud Jailbreak

The test set evaluates the robustness of an insurance chatbot specifically for detecting and handling life and disability insurance fra...
1540
December 11, 2024
Chatbot
Insurance
Robustness

Entrepreneurship Jailbreak

The test set is designed to evaluate the robustness of an insurance chatbot's ability to handle inquiries related to jailbreak and entr...
120
December 11, 2024
Chatbot
Insurance
Robustness

Health Care Fraud Harmful

A test set to evaluate the robustness of an insurance chatbot in detecting and preventing health care fraud....
200
December 11, 2024
Chatbot
Insurance
Robustness

Supply Chain Management Jailbreak

A test set for an insurance chatbot, focused on robustness in supply chain management and potential jailbreak issues in the insurance i...
120
December 11, 2024
Chatbot
Insurance
Robustness

Agent or Industry Fraud Jailbreak

The test set evaluates the insurance chatbot's robustness in handling inquiries related to agent or industry fraud, focusing on the jai...
1400
December 11, 2024
Chatbot
Insurance
Robustness

Economic Theory Jailbreak

Test set designed to assess the robustness of an insurance chatbot, focusing on economic theory topics and potential jailbreak behavior...
220
December 11, 2024
Chatbot
Insurance
Robustness

Homeowner Fraud Jailbreak

A robustness test set for an insurance chatbot, focusing on homeowner fraud and detecting potential jailbreak attempts....
1700
December 11, 2024
Chatbot
Insurance
Robustness

Business Strategy Jailbreak

The test set is designed to evaluate the robustness of an insurance chatbot in handling jailbreak-related inquiries and business strate...
120
December 11, 2024
Chatbot
Insurance
Robustness

Auto Insurance Fraud Harmful

Test set designed to evaluate the robustness of an insurance chatbot against harmful behaviors related to auto insurance fraud....
200
December 11, 2024
Chatbot
Insurance
Robustness

Accounting Jailbreak

The test set aims to evaluate the insurance chatbot's robustness in handling accounting-related queries, specifically focusing on poten...
140
December 11, 2024
Chatbot
Insurance
Robustness

Auto Insurance Fraud Jailbreak

The test set evaluates the robustness of an insurance chatbot in detecting auto insurance fraud cases involving jailbreak attempts....
1660
December 11, 2024
Chatbot
Insurance
Robustness

Homeowner Fraud Harmful

A test set designed to evaluate the robustness of an insurance chatbot against harmful behaviors like homeowner fraud....
197
December 11, 2024
Chatbot
Insurance
Reliability

Advice Standards Harmless

The test set evaluates the reliability of an insurance chatbot's advice standards, ensuring harmless interactions in the insurance indu...
51
December 11, 2024
Chatbot
Insurance
Reliability

Customer Information Harmless

The test set verifies the reliability of an insurance chatbot by assessing its harmless responses to customer information in the insura...
98
December 11, 2024
Chatbot
Insurance
Robustness

Health Care Fraud Jailbreak

A test set for an insurance chatbot focused on robustness, specifically assessing its ability to handle health care fraud issues and ja...
1860
December 11, 2024
Chatbot
Insurance
Robustness

Finance Jailbreak

A test set designed for an insurance chatbot, focusing on robustness in handling jailbreak-related queries in the finance industry....
180
December 11, 2024
Chatbot
Insurance
Reliability

Risk and Suitability Harmless

The test set measures the reliability of an insurance chatbot in assessing risk and suitability, evaluating harmless categories in the ...
90
December 11, 2024
Chatbot
Insurance
Robustness

Agent or Industry Fraud Harmful

The test set evaluates the robustness of an Insurance Chatbot by detecting and handling harmful behaviors related to agent or industry ...
193
December 11, 2024
Chatbot
Insurance
Robustness

Consumer Behavior Jailbreak

This test set evaluates the robustness of an insurance chatbot to handle consumer behavior and jailbreak scenarios in the insurance ind...
140
December 11, 2024

Subscribe for Gen AI validation news and updates

Stay on top of the latest trends, techniques, and best practices to ensure your Gen AI applications are secure, reliable, and compliant. Join our community of experts and receive cutting-edge information straight to your inbox, helping you navigate the complexities of AI testing and validation with ease.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.