PHASE 1

Identify end-users and stakeholders

In this phase, we establish the initial context by identifying all relevant stakeholders and end users. This includes those who will interact directly with the AI assistant, as well as those affected by its outputs or dependent on its performance.

SHOW MORE ▼

PHASE OUTCOME: Clear understanding of the assistant’s context and purpose.

GATHER CONTEXTUAL KNOWLEDGE

RESOURCE

Documentation review

Review existing research, documentation, and reports to identify stakeholders, users, and operational needs relevant to the AI system.

YOU CAN START WITH THE HAIKU GLOSSY REPORT

Outcome

Knowledge Database

IDENTIFY STAKEHOLDERS

METHODOLOGY

Stakeholder mapping

Identify and categorise individuals or groups who affect or are affected by the AI system.

Required HF Expertise

Time required (est.)

Purpose

Stakeholder Mapping helps you identify all relevant actors connected to the AI system, including users, decision-makers, and external parties. It supports understanding of who to engage, their interests, their level of influence, how much the system impacts them, and the extent to which they can impact the system.

What you need

Project scope and objectives
Team members familiar with the context
Optional: list of contacts

Results

Visual map of stakeholder roles and relationships
Clear identification of user types and influencers

See cheatsheet

Outcome

Visual stakeholder map highlighting key actors, their roles, and relationships.

PHASE 2

Understand context and identify requirements (Human, AI)

This phase establishes a foundational understanding of the work environment, human roles, and key challenges the AI assistant must support. Techniques such as Observation & Walk-throughs, Semi-Structured Interviews, and Focus Groups help ensure that the work as currently done is thoroughly understood, including all the factors, uncertainties, trade-offs and constraints that human end users habitually manage on a day-to-day basis. This in-depth understanding helps to ensure the AI assistant will be able to meet the challenge of assisting the human end user in realistic working conditions. Hierarchical Task Analysis enables a first structured allocation of the split of roles and tasks between human and AI assistant, showing how the future human-AI teaming concept could work in an integrated fashion. The additional consideration of Legal (e.g. EU Act on AI) and Regulatory Requirements (e.g. EASA) can further ensure a good allocation of function between human and AI system elements.

SHOW MORE ▼

PHASE OUTCOME: Defined AI role and its support to users, with human oversight ensured.

DATA COLLECTION (Initial Exploration)

METHODOLOGY

Observation; Walk/Talk-through; Verbal Protocol

Techniques to elicit expertise to understand what, how and why operators do.

Required HF Expertise

Time required (est.)

Purpose

Eliciting expertise, whether by watching operators perform their tasks (observation), asking them to go through the tasks step by step to explain them in more detail (walk-through / talk-through), and talking as they go through the tasks explaining what they are thinking about (verbal protocols – used for more cognitive tasks).

What you need

A realistic task scenario or use case
Access to real users working on a specific task (e.g.: air traffic controllers working in a tower, or pilots in the simulator)
A facilitator and note-taker or recording setup

Results

Notes on user behaviour and interaction patterns
Identification of pain points, confusion or inefficiencies in the real work carried out by users

See cheatsheet

Outcome

Understanding of human tasks

DATA COLLECTION Detailed Inquiry

METHODOLOGY

Semi-structured Interviews

Use guided conversations to explore user experiences while allowing flexibility and depth.

Required HF Expertise

Time required (est.)

Purpose

Gathers in-depth qualitative insights from users by combining prepared questions with open-ended discussion. Useful for uncovering needs, pain points, and contextual factors that may influence the design of the AI system.

What you need

An interview guide with core topics and open-ended questions
1 interviewer + 1 participant per session + 1 notetaker
Quiet space or online meeting tool
Consent procedures and note-taking / recording tools

Results

Rich qualitative data
Identification of user needs, goals, and pain points

See cheatsheet

Outcome

In-depth knowledge of how users experience a certain system/workflow/operation

DATA REPRESENTATION

METHODOLOGY

Hierarchical Task Analysis

Using a diagram to visualise how different tasks interlink to achieve a system goal.

Required HF Expertise

Time required (est.)

Purpose

Breaks down complex tasks into sub-tasks to better understand user goals, processes, and interactions. It explicitly represents what people need to do to achieve a goal, including actions and cognitive steps.

What you need

A well-defined user task or goal
1 analyst
SMEs (Subject Matter Experts) or access to relevant documentation
Basic understanding of the task

Results

Task hierarchy diagrams (graphical representation of goals, tasks, and sub-tasks)
Identification of bottlenecks or redundancies in existing workflows
Qualitative insights into user activities and potential areas for design intervention

See cheatsheet

Outcome

Clear representation of the human tasks

RUN WORKSHOPS TO ELICIT INITIAL REQUIREMENTS

METHODOLOGY

Focus Groups

Gather collective insights through structured group discussion.

Required HF Expertise

Time required (est.)

Purpose

Capture diverse opinions and experiences by guiding a group conversation on a focused topic. Useful for exploring needs, validating concepts, or identifying potential issues.

What you need

Skilled facilitator and a separate note taker
5–10 carefully selected participants
Discussion guide or scenarios
Neutral, comfortable space (it’s better to hold these face to face rather than online)

Results

Grouped user requirements
Refined ideas or scenarios

See cheatsheet

Outcome

List of user requirements

ANALYSE THE REGULATORY LANDSCAPE

RESOURCE

Legal documents

Useful external links to regulations, standards, and other background documents to guide your work.

EU AI ACT

Ethics Guidelines for Trustworthy AI

Outcome

List of legal requirements

EXPLORE EASA AI LEVELS

RESOURCE

EASA Guidance

Read about the concept of Human-AI Teaming (HAT) and the main design principles to ensure safe “Human-AI Interaction”

EASA Artificial Intelligence Concept Paper Issue 2

Outcome

List of compliance requirements

PHASE 3

Human–AI teaming CONOPS definition

In this phase it is important to understand how the human and AI will work together on specific tasks in a set of representative scenarios, whether routine, maintenance, emergency or a combination of the three. Scenario-Based Design can identify and explore different scenarios, and Ideation Sessions can then consider how human and AI would respond and interact. This can then be made explicit by mapping the different actors, events and actions in time-based Operations Sequence Diagram. Early application of the HAIQU tool in this Phase. HAIQU (Human-AI QUestionnaire) is a tool designed within the context of HAIKU to be used in a collaborative session to elicit and document teaming-specific requirements. It is envisaged to use HAIQU in multiple phases. In this phase, it is particularly useful to explore the areas of Human Centred Design, Roles and Responsibilities and Teamworking, which will help verify the Conops from a Human Factors Requirements perspective, and avoid problems costly changes in these areas in later Phases.

SHOW MORE ▼

PHASE OUTCOME: Detailed scenarios outlining human-AI teaming, ready for interface design.

UNDERSTAND THE TASK(S) DISTRIBUTION

METHODOLOGY

Scenario-Based Design

To describe existing or imagine new activities produced by interacting with a new tool.

Required HF Expertise

Time required (est.)

Purpose

To describe ‘thought simulations’, narrative accounts of users achieving goals with a product or system. It helps designers and users visualise current activities or predict how new tools will be used, stimulating creative design and prototyping.

What you need

User research insights or knowledge of prospective users
Defined user roles or “Personas” (optional but recommended)
Understanding of user motivations and potential interactions

Results

Vivid descriptions of user experiences
Identification of specific requirements for a new solution
Enhanced understanding of potential problem areas
Concrete, yet revisable, interpretation of design solutions

See cheatsheet

Outcome

Map of task allocation

MAP INTERACTIONS OVER TIME

METHODOLOGY

Operation Sequence Diagram (OSD)

Used for multi-person tasks and maps who does what/when/with which information.

Required HF Expertise

Time required (est.)

Purpose

To detail multi-person tasks by sequencing each operator's actions, demonstrating information transfer between crew members, and how the team functions with available equipment and automation. It's especially useful for critical operations.

What you need

Prior data from HTA, Critical Incident Technique, Observation, or Simulations.
Discussion with operational personnel is essential for realism.
Understanding of the task's timeline and potential goal evolution.

Results

Visualisation of task rhythm and required coordination.
Identification of information discrepancies between operators and system state.
Pinpointing key decision points, critical actions, and communication vulnerabilities.
Highlighting areas of time pressure, stress, or significant uncertainty.

See cheatsheet

Outcome

Map of task allocation

IDEATION

METHODOLOGY

Co-design session(s)

Collaboratively shape initial Human-AI teaming concepts with users and stakeholders.

Required HF Expertise

Time required (est.)

Purpose

Engage end-users and stakeholders in shaping early ideas about AI roles, human–AI interaction, and system behaviours before detailed design begins.

What you need

A clear design challenge or operational focus
1 facilitator + 1 note-taker or designer
Stakeholders and/or representative end-users
Materials for collaborative ideation (e.g., templates, sketching tools, cards)

Results

Shared understanding of operational needs
Early concepts of the AI assistant’s purpose and interaction model
Human-centred design foundations for the system

See cheatsheet

Outcome

Conceptualisation

VERIFY CONOPS FROM A HF PERSPECTIVE focus on Human Centred Design, Roles and Responsibilities and Teamworking

TOOL

Human-AI Teaming Questionnaire (HAIQU)

A web app for collaborative expert analysis of AI systems against Human Factors requirements.

Required HF Expertise

Time required (est.)

Purpose

To evaluate an AI concept or system's current status against typical Human Factors requirements across various areas. It aims to identify strengths and areas for improvement at different project stages.

What you need

A collaborative session with relevant stakeholders (e.g., AI developers, subject matter experts).
An expert facilitator familiar with HAIQU and its covered areas.
An AI concept or system, even if it's just a consolidated idea (a prototype is not strictly required).

Results

A clear overview of how the system addresses Human Factors requirements
Data visualisations (e.g., spider chart) in a dedicated dashboard.
A list of actionable items to improve the concept or system.

See cheatsheet

Outcome

Refinement of concept with Human-AI Teaming requirements

PHASE 4

Initial design Activity

In this phase a prototype, whether static (e.g. successive screenshots) or dynamic (an interactive interface) is developed and tested with end users using Low Fidelity Prototyping. To represent the AI part of the interaction, either a ‘Wizard of Oz’ approach is used in which a human pretends to be the AI (e.g. responding using text messages), or else a programme gives scripted answers that the ‘real’ AI would generate in the real situation. In some cases, an early prototype of the AI assistant itself may be ready, in which case it can be used (this can also help the ‘training’ of the AI). A key ingredient in human-AI teaming is sense-making, ensuring that the human and AI are ‘on the same page’. If prototyping suggests that the human may need to better understand what the AI is doing and why, this is where Explainability Generation should be applied. The HAIQU tool should be reapplied in this phase, focusing on Sense-Making (which includes displays, interactions and explainability) and Communications (particularly if speech interfaces are to be used). This phase is likely to go through several iterations and should enlist end-user feedback at each iteration.

SHOW MORE ▼

PHASE OUTCOME: Preliminary design of the human-AI interaction interface.

DEVELOP LOW-FIDELITY INTERACTIVE PROTOTYPES

METHODOLOGY

Low-Fidelity Prototyping

A cost-effective and efficient approach for assessing different design solutions and making informed decisions, at multiple levels of fidelity.

Required HF Expertise

Time required (est.)

Purpose

Low-fidelity prototypes, also called mock-ups or paper prototypes, are quick, simple representations of a design concept. They help designers gather early feedback on structure, content, and basic functionality without investing significant time or resources.

What you need

Basic materials (paper, cardboard, sticky notes, markers, scissors) if going for a physical prototype. Prototyping software (such as Figma) if going for a digital one.
A simple design concept or flow to prototype
Users or stakeholders to test the prototype

Results

Early insights into usability and user expectations
Feedback on layout, flow, and functionality
Ideas for refinement before committing to detailed design or development
For AI concepts, they help clarify how a feature might be explained or controlled by the user before technical implementation begins.

See cheatsheet

Outcome

Mock-up

EXPLAINABILITY GENERATION

METHODOLOGY

Construal Level Theory (CLT) for XAI Generation

A psychological framework applied to design layered, context-specific explanations for AI systems.

Required HF Expertise

Time required (est.)

Purpose

To design operational explainability (OpXAI) interfaces for Human-AI Teaming, especially in safety-critical contexts like aviation. It structures explanations at different levels of abstraction based on psychological distance, allowing users to progressively query information.

What you need

An AI system requiring explainability.
Understanding of user needs and operational context.
Consideration of time available for information assimilation.

Results

Explanation interfaces tailored to specific operational needs to be added to the HMI mock-up.
Information presented at appropriate levels of detail (e.g., from overview to in-depth data).

See cheatsheet

Outcome

Levels of explainability

SIMULATE AND REFINE BEHAVIOUR

METHODOLOGY

Wizard of OZ

A method for testing complex systems by simulating functionality with a human behind the scenes, avoiding costly development while gathering realistic feedback.

Required HF Expertise

Time required (est.)

Purpose

The Wizard of Oz method lets you test an interface that seems autonomous but is actually operated (in part or fully) by a human “wizard.” This is especially useful for designs involving AI or complex logic that is not yet implemented.

What you need

A prototype (digital or physical) that users can interact with
A facilitator to guide the session
A “wizard” to simulate the system’s behavior from behind the scenes

Results

Early insights on how people interact with systems that rely on AI, automation, or real-time responses
Validation of desired behaviors before building expensive back-end functionality
A clearer picture of edge cases and user expectations

See cheatsheet

Outcome

List of requirements to improve mock-up

VERIFY CONOPS FROM A HF PERSPECTIVE focus on Sensemaking and communications

TOOL

Human-AI Teaming Questionnaire (HAIQU)

2nd application

A web app for collaborative expert analysis of AI systems against Human Factors requirements.

Required HF Expertise

Time required (est.)

Purpose

To evaluate an AI concept or system's current status against typical Human Factors requirements across various areas. It aims to identify strengths and areas for improvement at different project stages.

What you need

A collaborative session with relevant stakeholders (e.g., AI developers, subject matter experts).
An expert facilitator familiar with HAIQU and its covered areas.
An AI concept or system, even if it's just a consolidated idea (a prototype is not strictly required).

Results

A clear overview of how the system addresses Human Factors requirements
Data visualisations (e.g., spider chart) in a dedicated dashboard.
A list of actionable items to improve the concept or system.

See cheatsheet

Outcome

Refinement of concept with Human-AI Teaming requirements

PHASE 5

Identify Risks and System-Level Issues

In this phase risks are identified related to the use of the AI-based assistant. Techniques such as Human HAZOP, SHELL and Expert Walk-through can be applied to low and higher-TRL projects, whereas techniques such as STPA are for later TRL projects (TRL 6+). These techniques all aim to identify and mitigate vulnerabilities in the human-AI teaming operation, and as such, normally only one technique needs to be applied, though the application of more than one may flag differently nuanced risks. Legal Case Methodology, usually applied to later TRL projects, considers legal aspects. The HAIQU tool can be applied again with a focus on Errors and Resilience, and the project progressing towards TRLs 5 and beyond may wish to revisit the EASA Regulations, specifically those relating to human-AI teaming, explainability and ethics.

SHOW MORE ▼

PHASE OUTCOME:Identified hazards, risks, and proposed mitigations.

IDENTIFY HAZARDS

METHODOLOGY

Human Hazard and Operability Study (HAZOP)

Structured workshop-based technique to anticipate deviations in human-AI interaction through systematic application of guidewords across operational sequences.

Required HF Expertise

Time required (est.)

Purpose

To proactively identify hazards and mismatches in human–AI teaming by exploring “what-if” deviations from expected operational behaviour, especially under degraded or uncertain conditions.

What you need

Experienced HAZOP facilitator with aviation and risk expertise
Multidisciplinary team: HF experts, AI developers, system users (pilots, ATCOs), data scientists
Participation of concept developers or end users depending on system TRL
Dedicated secretary for documentation
Guideword list

Results

Identification of potential human–AI interaction failures
Traceable link between hazards and design evolution
Actionable mitigations and design recommendations

See cheatsheet

Outcome

List of hazards from human and AI perspective

IDENTIFY GENERAL RISKS AND FURTHER REQUIREMENTS

METHODOLOGY

SHELL Model

Shows key human factors components of sociol-technical systems. Allows problem analysis.

Required HF Expertise

Time required (est.)

Purpose

To systematically gather and sort investigation data in safety-critical systems, particularly focusing on the interactions between the human element and other system components. It helps identify potential sources of performance failure due to dysfunctional interactions.

What you need

A socio-technical system for analysis.
Understanding of the system’s components and their interdependencies.
(If available) Data from incident/accident investigations or safety analyses.

Results

Systematic analysis and identification of problems.
Highlighted relationships between human elements and other factors.
Structured factual information for incident reports.
Insights into reducing issue probability and applying detection/correction means.

See cheatsheet

Outcome

List of hazards from human and AI perspective

EVALUATE CONCEPT ROBUSTNESS

METHODOLOGY

Expert Walkthroughs

Assess concept robustness by gathering expert feedback on systemlevel functionality

Required HF Expertise

Time required (est.)

Purpose

Validate early concepts or system prototypes by leveraging the domain knowledge of operational experts. Helps identify gaps, unrealistic assumptions, and potential use issues before detailed design.

What you need

A high-fidelity prototype, mock-up, storyboard, or concept description
1–3 domain experts (e.g., pilots, air traffic controllers)
A facilitator with HF or UX background
Structured script or key scenario prompts

Results

Expert insights on feasibility and usability
Early detection of system-level risks or design blind spots
Verbal feedback, annotated artifacts, structured notes

See cheatsheet

Outcome

Identification of operational weak spots

EVALUATE ACCIDENTS FROM A SYSTEM LEVEL POINT OF VIEW

METHODOLOGY

System-Theoretic Accident Model & Processes (STAMP)

A systemic accident analysis method focusing on control processes and dysfunctional interactions within socio-technical systems.

Required HF Expertise

Time required (est.)

Purpose

To analyse accidents and prevent failures by viewing them as control problems within complex socio-technical systems. It models the interrelationships between people, organisational structures, engineering activities, and system components, identifying where adaptive feedback functions fail to maintain safety.

What you need

A complex, tightly coupled socio-technical system for analysis of designs.
A high level of expertise in system theory and control systems.
Resources for an in-depth analysis of regular processes and organisational structures.

Results

An overview of all links and interactions between system components.
Identification of control errors that could lead to accidents.
A model of the organisational safety structure.
Insights for accident investigation, prevention, hazard analysis, and risk management.

See cheatsheet

Outcome

List of compliance requirements

IDENTIFY LEGAL RISKS

METHODOLOGY

Legal Case Methodology

Identifying, analysing, and mitigating legal risks, particularly liability issues, arising from AI and advanced automation in safety-critical domains.

Required HF Expertise

Time required (est.)

Purpose

Proactively identify and address potential legal issues, especially liability risks for providers, organisations and operators, that may arise from the use of AI-based solutions in aviation. It helps ensure that new technologies are not only safe and efficient but also legally sound and acceptable to all stakeholders.

What you need

Detailed information about the operational concept and its context.
Understanding of the AI system's level of automation and its impact on roles, tasks, and responsibilities.
Relevant legal and regulatory frameworks (e.g., aviation law, EU AI Act, Product Liability Directive).
SHS assessments (e.g., OSD, HAZOP analyses) for the system.

Results

Identification of potential liability risks for various stakeholders (human operators, organizations, technology providers).
Analysis of how AI impacts existing roles and responsibilities, and the emergence of new ones.
Actionable recommendations for design improvements and mitigation strategies to reduce legal exposure.

See cheatsheet

Outcome

Document outlining all legal risks

MONITOR SYSTEM EVOLUTION Focus on Errors and Resilience

TOOL

Human-AI Teaming Questionnaire (HAIQU)

3rd application

A web app for collaborative expert analysis of AI systems against Human Factors requirements.

Required HF Expertise

Time required (est.)

Purpose

To evaluate an AI concept or system's current status against typical Human Factors requirements across various areas. It aims to identify strengths and areas for improvement at different project stages.

What you need

A collaborative session with relevant stakeholders (e.g., AI developers, subject matter experts).
An expert facilitator familiar with HAIQU and its covered areas.
An AI concept or system, even if it's just a consolidated idea (a prototype is not strictly required).

Results

A clear overview of how the system addresses Human Factors requirements
Data visualisations (e.g., spider chart) in a dedicated dashboard.
A list of actionable items to improve the concept or system.

See cheatsheet

Outcome

Refinement of concept with Human-AI Teaming requirements

REVISIT EASA

RESOURCE

EASA Guidance

2nd application

Read about the concept of Human-AI Teaming (HAT) and the main design principles to ensure safe “Human-AI Interaction”

EASA Artificial Intelligence Concept Paper Issue 2

Outcome

Updated list of compliance requirements

PHASE 6

Validate and iterate higher fidelity designs

In this phase a dynamically interactive system is available and tested with licensed end users in a high-fidelity simulation across one or more scenarios. Performance is measured, including overall system performance as well as the performance of the constituent components (human and AI). Since the AI assistant is there to support the human end user, measures such as workload and situation awareness may be measured, as well as canvassing simulation participants’ views on the degree of support afforded by the AI assistant, via Qualitative Debriefings and post-simulation questionnaires such as the System Usability Scale. In some cases, more advanced ‘Neuro-ID’ psycho-physiological measures (e.g. heart rate, galvanic skin response, EEG, etc.) may be used to infer impacts on the human user. Eye Tracking may also be used to determine effects on pilot or air traffic controller visual patterns and sense-making of the scenario, or to better track the dynamic interaction between human and AI. System Logs can often help understand the detail of such interactions. Following such simulations (often there are more than one, to allow at least one design iteration), the HAIQU tool can be run again for the previous six areas, updating earlier responses where new insights or information have arisen. As mentioned under Phase 4, towards the end of Phase 5 hazard analyses should be repeated to see if mitigations identified in Phase 4 worked, and if any new hazards have been discovered.

SHOW MORE ▼

PHASE OUTCOME: Most HF requirements addressed; initial user approval achieved.

PRODUCE ROBUST PROTOTYPES

Build robust prototype

Translate your validated design concepts into working code to create a functional and testable prototype.

Outcome

Working prototype for evaluation

CONDUCT SIMULATIONS

METHODOLOGY

Real-time simulations

To test the future product and simulate the environment under assessment in real time.

Required HF Expertise

Time required (est.)

Purpose

Simulate real-world conditions with human operators to design, test and refine new concepts, procedures, and system functionalities. Supports safe, repeatable evaluation of performance and user acceptance before implementation.

What you need

Access to realistic simulation facilities (e.g., ATC or cockpit simulators)
Licensed operators (e.g., controllers, pilots)
Defined reference and future scenarios
Human performance objectives and measurement criteria
Observation plan and questionnaires (for post-experiment analysis)

Results

Objective human performance data (e.g., workload, task success, communication metrics)
Subjective feedback on usability, trust, and acceptance
Evidence to support design decisions and refinements

See cheatsheet

Outcome

Prototype integrated with simulator

COLLECT VISUAL DATA During the experiment

METHODOLOGY

Eye tracking analysis technique

To test the future product and simulate the environment under assessment in real time.

Required HF Expertise

Time required (est.)

Purpose

Understand how users gather and process visual information during tasks, uncovering insights that might not be verbalised. Eye tracking supports skills capture, task performance measurement, and workload assessment.

What you need

Eye tracking equipment (head-mounted or screen-mounted)
Specialist software for recording and analysing data
Trained analyst to set up, calibrate, and interpret results
Participants performing representative tasks

Results

Scan-path visualisations
Insights into usability, information layout, and cognitive load
Evidence for interface redesign or training needs

See cheatsheet

Outcome

Data of usage during the experiment

COLLECT COGNITIVE DATA During the experiment

METHODOLOGY

Neuro-ID

To test the future product and simulate the environment under assessment in real time.

Required HF Expertise

Time required (est.)

Purpose

Measure operators’ mental and emotional states in real time, going beyond self-reports to capture unconscious reactions. NEURO-ID combines neurophysiological data with behavioural and incident data to map the Human Performance Envelope (HPE).

What you need

Wearable neurophysiological sensors (EEG, ECG, EOG, GSR, etc.)
Data acquisition hardware and software
Data-mining and machine-learning tools
Domain knowledge to interpret results in context

Results

Objective measures of mental workload, stress, vigilance, and engagement
Human Performance Envelope (HPE) definition for each operator
Insights for adaptive automation or real-time monitoring
Links to incident and accident investigations

See cheatsheet

Outcome

Data of usage during the experiment

COLLECT DATA During the experiment, qualitative data

METHODOLOGY

Observation; Walk/Talk-through; Verbal Protocol

2nd application

Techniques to elicit expertise to understand what, how and why operators do.

Required HF Expertise

Time required (est.)

Purpose

Eliciting expertise, whether by watching operators perform their tasks (observation), asking them to go through the tasks step by step to explain them in more detail (walk-through / talk-through), and talking as they go through the tasks explaining what they are thinking about (verbal protocols – used for more cognitive tasks).

What you need

A realistic task scenario or use case
Access to real users working on a specific task (e.g.: air traffic controllers working in a tower, or pilots in the simulator)
A facilitator and note-taker or recording setup

Results

Notes on user behaviour and interaction patterns
Identification of pain points, confusion or inefficiencies in the real work carried out by users

See cheatsheet

Outcome

Qualitative data to explore the UX

COLLECT DATA During/post experiment, quantitative data

METHODOLOGY

System Log Analysis

Analyse recorded system data to understand user actions, AI responses, and interaction patterns over time.

Required HF Expertise

Time required (est.)

Purpose

Leverage system-generated data to understand real user behaviour and system interaction patterns.

What you need

Set up and access to structured logs (time-stamped user actions, system events)
HF analyst or data-savvy team member
Analysis tool (e.g., spreadsheet, Python, dashboard)

Results

A timeline of user–system interaction
Visibility into AI model outputs and decision points
Evidence for system refinement or HF evaluation

See cheatsheet

Outcome

Quantitative data of usage

COLLECT DATA Post-experiment, qualitative data

METHODOLOGY

Qualitative Debriefings

Gather in-depth feedback after users interact with a prototype or system.

Required HF Expertise

Time required (est.)

Purpose

Explore users’ thoughts, experiences, and reactions following a prototype interaction or validation session. Helps uncover usability issues, cognitive workload, and unmet expectations.

What you need

A completed system interaction or validation session
Interview protocol with open-ended prompts
1 facilitator (with basic interview skills)
Recording or note-taking setup

Results

Rich qualitative data (quotes, themes, behavioural insights)
Understanding of what worked, what didn’t, and why
Input to guide refinements to the prototype or system

See cheatsheet

Outcome

Qualitative data to explore the UX

COLLECT DATA VIA QUESTIONNAIRES Post-experiment, mixed

METHODOLOGY

Post-exercise questionnaire

Gather structured user feedback after system use to assess usability, workload, trust, and more.

Required HF Expertise

Time required (est.)

Purpose

Capture different angles of the user experience in a structured and measurable way following a system interaction, to support validation and improvement.

What you need

A completed validation session or exercise
Appropriate questionnaire(s) selected
Printed or digital forms
Basic understanding of each scale’s focus

Results

Quantitative and qualitative feedback on system performance and user experience
Comparative data across sessions or user types
Inputs for refining usability, workload, and acceptance

See cheatsheet

Outcome

Actionable data to use for future iterations

MONITOR SYSTEM EVOLUTION Full evaluation

TOOL

Human-AI Teaming Questionnaire (HAIQU)

4th application

A web app for collaborative expert analysis of AI systems against Human Factors requirements.

Required HF Expertise

Time required (est.)

Purpose

To evaluate an AI concept or system's current status against typical Human Factors requirements across various areas. It aims to identify strengths and areas for improvement at different project stages.

What you need

A collaborative session with relevant stakeholders (e.g., AI developers, subject matter experts).
An expert facilitator familiar with HAIQU and its covered areas.
An AI concept or system, even if it's just a consolidated idea (a prototype is not strictly required).

Results

A clear overview of how the system addresses Human Factors requirements
Data visualisations (e.g., spider chart) in a dedicated dashboard.
A list of actionable items to improve the concept or system.

See cheatsheet

Outcome

Refinement of concept with Human-AI Teaming requirements

LOOPING BACK

HAZARD Analysis

Repeat steps from previous phases to see if the potential hazards persists

Outcome

Improved concept

PHASE 7

Deployment and Continuous Improvement

This phase consists of preparation for transition to deployment into the intended operational, organisational and social environment for which the AI-based assistant will be used. The HAIQU tool contains two areas relevant to Phase 7, namely Competencies and Training, and Organisational Readiness. Additionally, two questionnaires on Societal Acceptance and Safety Culture help determine the readiness of the user population to accept the new technology, and any concerns over impacts on individual or organisational safety culture. When the tool is first being deployed and people are being trained to use it and working with the tool, it can be useful to apply the User Journey Map technique on a representative sample of end users. This tool picks up annoyances (called ‘pain points’), whether related to the tool itself, the way it is being released and deployed into the system, or lack of smooth integration into legacy systems. Such problems can detract from the tool’s effective usage, such that its full benefits are never realised, no matter how well it was designed. Lastly, Error Reporting on the use of the tool is critical in the early deployment phase in case of errors (human or AI) or misunderstandings or other problems. If such problems are not detected quickly and corrected, the AI assistant will rapidly fall into disuse. This Phase does not end until decommissioning, and so is a continuous learning and adaptation phase, and hopefully one in which the AI assistant becomes a valued part of the aviation system in which it serves. This phase implies successive AI Maintenance activities (such as continuous monitoring and benchmarking, and model retraining whenever necessary).

SHOW MORE ▼

PHASE OUTCOME: AI assistant deployed, monitored, and continuously improved for effective long-term use.

FOCUS ON COMPETENCIES AND ORGANISATIONAL READINESS

TOOL

Human-AI Teaming Questionnaire (HAIQU)

5th application

A web app for collaborative expert analysis of AI systems against Human Factors requirements.

Required HF Expertise

Time required (est.)

Purpose

To evaluate an AI concept or system's current status against typical Human Factors requirements across various areas. It aims to identify strengths and areas for improvement at different project stages.

What you need

A collaborative session with relevant stakeholders (e.g., AI developers, subject matter experts).
An expert facilitator familiar with HAIQU and its covered areas.
An AI concept or system, even if it's just a consolidated idea (a prototype is not strictly required).

Results

A clear overview of how the system addresses Human Factors requirements
Data visualisations (e.g., spider chart) in a dedicated dashboard.
A list of actionable items to improve the concept or system.

See cheatsheet

Outcome

Refinement of concept with new competencies/organisational requirements

ASSESS BROADER IMPACT

METHODOLOGY

Societal Acceptance Questionnaire

A questionnaire to measure perceptions and attitudes toward use of an AI-based system

Required HF Expertise

Time required (est.)

Purpose

Identifying and monitoring perceptions and attitudes toward use of AI-based systems through a 23-question questionnaire. Use it from early stages of design throughout the whole development cycle, involving a diverse range of stakeholders, from concept owners to end-users. It allows proactively to spot relevant aspects that could enable or inhibit the success of the proposed solutions.

What you need

The Societal Acceptance Questionnaire in a document (either digital or printed)
A diverse group of relevant stakeholders who have been introduced to the system
An interviewer is optional but recommended to guide the participant and ensure the process is effective

Results

Quantitative and quantitative data on eight key dimensions of societal acceptance
Identification of relevant societal aspects that could enable or inhibit the success of the proposed solutions

See cheatsheet

Outcome

Concrete numbers related to the Societal Acceptane for your IA

ASSESS SAFETY CULTURE

METHODOLOGY

Safety Culture Debrief

A short questionnaire for aviation workers (pilots and ATCOs) to elicit perceptions and judgements about an IAs potential impact on Safety Culture.

Required HF Expertise

Time required (est.)

Purpose

Takes a snapshot of safety culture perceptions of aviation personnel who have experienced using an AI-based digital assistant, e.g. in a simulator. This is prospective safety culture, but can still yield useful insights into how pilots/ATCOs feel about future HAT concepts, and how it could affect their safety culture, whether positively or negatively. It can also give an overall perception of the desirability of the IA from the user's perspective

What you need

The safety culture question set
1 interviewer
1 participant per session
questions on an iPad using a simple survey platform such as SurveyMonkey
Quiet space or online meeting tool
Consent procedures and note-taking/ recording tools

Results

Rich qualitative data
Identification of user’s opinions and perceptions about the impact of the HA concept on future safety culture

See cheatsheet

Outcome

Deep understanding of how safety culture might be concretely impacted by the IA

IDENTIFY PAIN POINTS FROM ACTUAL USAGE

METHODOLOGY

User Journey Map

Map real user experiences to uncover gaps, pain points, and improvement opportunities.

Required HF Expertise

Time required (est.)

Purpose

Visualise how users actually experience the AI assistant in real settings: what they do, think, and feel, to identify mismatches, breakdowns, and new needs.

What you need

Real-world data from interviews, observations, or analytics
A defined scenario and user goal
1 facilitator + cross-functional team
Journey map template (if applicable, to make things easier)

Results

A map of real user behaviour and sentiment across phases
Identified pain points and unmet expectations

See cheatsheet

Outcome

Clear overview on the UX of your system

MAINTAIN THE AI SYSTEM

AI Maintenance activities

Monitor and benchmark
Model retraining

Outcome

Updated and optimised AI model

REPEAT STEPS

Design Iteration

Redesign mockup and review concept based on first simulations. Go back to previous steps in the path if necessary

Outcome

Improved concept

Guideword	Meaning
No/None	Task omitted or skipped
More	Excessive output/action
Less	Inadequate output/action
As Well As	Unintended additional effect
Part Of	Incomplete outcome
Reverse	Opposite outcome
Other Than	Wrong action/response taken
Early	Action triggered too soon
Late	Action triggered too late

Identify end-users and stakeholders

Documentation review

Stakeholder mapping

Understand context and identify requirements (Human, AI)

Observation; Walk/Talk-through; Verbal Protocol

Semi-structured Interviews

Hierarchical Task Analysis

Focus Groups

Legal documents

EASA Guidance

Human–AI teaming CONOPS definition

Scenario-Based Design

Operation Sequence Diagram (OSD)

Co-design session(s)

Human-AI Teaming Questionnaire (HAIQU)

Initial design Activity

Low-Fidelity Prototyping

Construal Level Theory (CLT) for XAI Generation

Wizard of OZ

Human-AI Teaming Questionnaire (HAIQU)

Identify Risks and System-Level Issues

Human Hazard and Operability Study (HAZOP)

SHELL Model

Expert Walkthroughs

System-Theoretic Accident Model & Processes (STAMP)

Legal Case Methodology

Human-AI Teaming Questionnaire (HAIQU)

EASA Guidance

Validate and iterate higher fidelity designs

Build robust prototype

Real-time simulations

Eye tracking analysis technique

Neuro-ID

Observation; Walk/Talk-through; Verbal Protocol

System Log Analysis

Qualitative Debriefings

Post-exercise questionnaire

Human-AI Teaming Questionnaire (HAIQU)

HAZARD Analysis

Deployment and Continuous Improvement

Human-AI Teaming Questionnaire (HAIQU)

Societal Acceptance Questionnaire

Safety Culture Debrief

User Journey Map

AI Maintenance activities

Design Iteration

Overview

Why use it?

When to use it?

Step by step instructions

Clarify the System Context

Brainstorm Stakeholders

Organise into Categories

Visualise relationships on a chart

Prioritise and Plan

Benefits

Limitations

Overview

Why use it?

When to use it?

Step by step instructions

Define the goal and context

Set up a day dedicated to the activity

Run the observation session

Capture data

Analyse findings

Benefits

Limitations

Overview

Why use it?

When to use it?

Step by step instructions

Define your objective

Prepare an interview guide

Recruit and schedule

Conduct interviews

Analyse data

Summarise findings

Benefits

Limitations