Capstone Project

Calibrating Trust in AI: Designing the TRUST-CT Scale

A human–AI interaction capstone exploring how interface design and a new scale can support healthy skepticism, reduce overreliance, and promote critical thinking with AI-generated text.

Role: Primary researcher & designer

Course: HSE 477 – Capstone

Focus: Human–AI trust, automation bias, digital literacy

Human–AI Interaction Trust & Overreliance Experimental Design Scale Development

Project Overview

As large language models like ChatGPT, Copilot, and Gemini enter everyday workflows, people are increasingly relying on AI-generated text to make academic, professional, and personal decisions. My capstone project asks: what makes people trust AI responses, and when does that trust actually support critical thinking?

I designed TRUST-CT (Trust for Critical Thinking), a user-facing scale that measures why someone trusts an AI output and whether that trust is likely to lead to healthy verification behaviors instead of blind acceptance.

The Problem

Existing trust scales were largely created for traditional automation (like autopilot systems), not generative AI that can be confidently wrong and hallucinate. That mismatch creates two failure modes:

Over-trust / automation bias – users accept AI outputs without checking.
Algorithm aversion – users see one error and reject AI even when it would help.

Many current measures focus on “increasing trust,” but don’t ask whether that trust is appropriately calibrated or whether the user actually verifies claims.

Research Goals

Main goal: create a brief, validated scale that captures the reasons behind a user’s trust in AI and predicts whether they will double-check the information.

I focused on three questions:

What latent factors best explain user trust judgments that trigger verification?
Does TRUST-CT predict appropriate reliance (accepting correct AI, rejecting incorrect AI)?
Which interface cues (uncertainty, citations, explanations) most improve trust calibration?

TRUST-CT Scale Concept

TRUST-CT adapts information-literacy frameworks like CRAAP and SIFT to a human–AI trust setting. It operationalizes five user-centered factors:

Transparency – does the AI reveal limits, uncertainty, and gaps?
Reliability – does it feel consistent and accurate over time?
Understandability – can users follow the reasoning behind a response?
Source Credibility – are sources visible, checkable, and trustworthy?
Teleology (Purpose) – is the AI’s purpose clear and aligned with user goals?

The scale is intended not only to measure trust but also to nudge users into a brief moment of reflection when they read an AI-generated claim.

My Role

Conducted literature review & gap analysis
Developed the TRUST-CT conceptual framework and items
Designed the experimental online study
Planned psychometric validation (EFA/CFA, reliability)
Defined analysis pipeline for behavioral outcomes

Study Design

Participants: 150–200 adults (18+), fluent in English, with prior AI-tool use.

Task: participants read AI-generated statements on neutral topics that vary along:

Accuracy: correct vs. subtly incorrect (hallucinated)
Citations: present vs. absent
Uncertainty cues: present vs. absent
Explanation type: rationale vs. brief summary

Each person sees a balanced set of statements in a 2×2×2×2 within-subjects design, with randomized order to reduce carryover effects.

Measures & Data

For each statement, I collect:

TRUST-CT item ratings
A global 1–7 trust rating
Decision: accept, question, or reject the AI answer
Whether they click a “Check source” link
Response time and decision accuracy

Why this matters:

Links self-reported trust to real verification behavior
Separates “feels trustworthy” from “actually checked it”

Analysis Plan

Scale validation:

Exploratory Factor Analysis (EFA) on half the sample
Confirmatory Factor Analysis (CFA) on the other half
Reliability via Cronbach’s α and McDonald’s ω

Effect of interface cues:

Mixed ANOVAs on trust scores, verification, and accuracy
Logistic regression for binary outcomes (verify vs. not)
Hierarchical regression to see whether TRUST-CT adds predictive value beyond existing scales

Impact & What I Learned

This work reframes AI design away from “making users trust AI more” toward helping them trust it appropriately. TRUST-CT is meant to:

Provide a validated, user-facing instrument for AI trust and verification
Support digital literacy and critical thinking around AI outputs
Inform interface decisions like uncertainty display, citations, and explanations

As a researcher and designer, this project stretched my skills in experimental design, psychometrics, and translating abstract trust concepts into concrete interaction patterns.