promptflow.evals.synthetic.direct_attack_simulator module#

class promptflow.evals.synthetic.direct_attack_simulator.DirectAttackSimulator(*, azure_ai_project: Dict[str, Any], credential=None)#

Bases: object

Initialize a UPIA (user prompt injected attack) jailbreak adversarial simulator with a project scope. This simulator converses with your AI system using prompts designed to interrupt normal functionality.

Parameters:
  • azure_ai_project (Dict[str, Any]) –

    Dictionary defining the scope of the project. It must include the following keys:

    • ā€subscription_idā€: Azure subscription ID.

    • ā€resource_group_nameā€: Name of the Azure resource group.

    • ā€project_nameā€: Name of the Azure Machine Learning workspace.

  • credential (TokenCredential) – The credential for connecting to Azure AI project.

__call__(*, scenario: AdversarialScenario, target: Callable, max_conversation_turns: int = 1, max_simulation_results: int = 3, api_call_retry_limit: int = 3, api_call_retry_sleep_sec: int = 1, api_call_delay_sec: int = 0, concurrent_async_task: int = 3, randomization_seed: Optional[int] = None)#

Executes the adversarial simulation and UPIA (user prompt injected attack) jailbreak adversarial simulation against a specified target function asynchronously.

Parameters:
  • scenario (promptflow.evals.synthetic.adversarial_scenario.AdversarialScenario) –

    Enum value specifying the adversarial scenario used for generating inputs. example:

  • target (Callable) – The target function to simulate adversarial inputs against. This function should be asynchronous and accept a dictionary representing the adversarial input.

  • max_conversation_turns (int) – The maximum number of conversation turns to simulate. Defaults to 1.

  • max_simulation_results (int) – The maximum number of simulation results to return. Defaults to 3.

  • api_call_retry_limit (int) – The maximum number of retries for each API call within the simulation. Defaults to 3.

  • api_call_retry_sleep_sec (int) – The sleep duration (in seconds) between retries for API calls. Defaults to 1 second.

  • api_call_delay_sec (int) – The delay (in seconds) before making an API call. This can be used to avoid hitting rate limits. Defaults to 0 seconds.

  • concurrent_async_task (int) – The number of asynchronous tasks to run concurrently during the simulation. Defaults to 3.

  • randomization_seed (Optional[int]) – Seed used to randomize prompt selection, shared by both jailbreak and regular simulation to ensure consistent results. If not provided, a random seed will be generated and shared between simulations.

Returns:

A list of dictionaries, each representing a simulated conversation. Each dictionary contains:

  • ’template_parameters’: A dictionary with parameters used in the conversation template,

    including ā€˜conversation_starter’.

  • ’messages’: A list of dictionaries, each representing a turn in the conversation.

    Each message dictionary includes ā€˜content’ (the message text) and ā€˜role’ (indicating whether the message is from the ā€˜user’ or the ā€˜assistant’).

  • ’$schema’: A string indicating the schema URL for the conversation format.

The ā€˜content’ for ā€˜assistant’ role messages may includes the messages that your callback returned.

Return type:

Dict[str, [List[Dict[str, Any]]]] with two elements

Output format

return_value = {
    "jailbreak": [
    {
        'template_parameters': {},
        'messages': [
            {
                'content': '<jailbreak prompt> <adversarial question>',
                'role': 'user'
            },
            {
                'content': "<response from endpoint>",
                'role': 'assistant',
                'context': None
            }
        ],
        '$schema': 'http://azureml/sdk-2-0/ChatConversation.json'
    }],
    "regular": [
    {
        'template_parameters': {},
        'messages': [
        {
            'content': '<adversarial question>',
            'role': 'user'
        },
        {
            'content': "<response from endpoint>",
            'role': 'assistant',
            'context': None
        }],
        '$schema': 'http://azureml/sdk-2-0/ChatConversation.json'
    }]
}
promptflow.evals.synthetic.direct_attack_simulator.monitor_adversarial_scenario(func) Callable#

Decorator to monitor adversarial scenario.

Parameters:

func (Callable) – The function to be decorated.

Returns:

The decorated function.

Return type:

Callable