Primary behavioural data for frontier AI.
We don't scrape the internet or annotate output. We capture what humans do. Complex multi-modal interactions live, fully instrumented, and structured for frontier AI training.
The OG intelligence source – humans
We record every session live with a named, paid, consenting expert. We can tell you the role, the brief, and the date. No scraped corpora, no orphaned provenance.
Screen, voice, keystroke, gaze.
Most data on the open web is text exhaust, and real expert work isn't. We capture the full signal across cursor paths, hesitations, voiced reasoning, and screen state. That's where tacit knowledge lives.
Built for the labs that ship to billions.
Our schema and review pipelines come out of direct conversation with frontier model teams. Every artifact ships with granular consent, attribution chains, and audit trails. The output is ready for the workflows that train frontier models.
Deep human experience, refined into post-training fuel.
The hard part of training a useful model isn't compute. It's the quality of the human signal underneath. Most teams reach for the same exhausted corpora, then layer annotators on top.
We work the other way. We start with the raw practice of real work, and refine it into structured samples that preserve reasoning, modality, and context.
Same shape as a production pipeline: capture, instrumentation, delivery.
Sessions, with experts, in their actual environment.
Practitioners doing real work, in the tools they already use. Software, peripherals, voice, screen, artefacts. No simulated tasks or synthetic prompts.
Every action timestamped, every modality aligned.
Structured signal from the moment a session starts. Millisecond timing, transcription, decision-point tagging, reasoning capture. Provenance and consent built in.
Schema-conformant, ingestion-ready.
We schema to the partner lab's pipeline, not to a generic format. Sessions arrive ready for direct ingestion — full provenance, expert attribution, trajectory data shaped the way post-training actually needs it. No reformatting. No second-pass cleanup.
Ten session formats. Each grounded in a slice of the sixty-two-pattern catalogue.
Expert demonstrating interface use to naive user
CAT 02Naive user learning from expert
CAT 03Expert reviews user's unmoderated session
CAT 04Multi-party discussion
CAT 05Multi-stage handoff
CAT 06Solo expert thinking aloud
CAT 07Collaborative problem-solving
CAT 08Asymmetric expertise dialogue
CAT 09AI-moderated session with post-task review
CAT 10Multilingual / translation sessions
Operated by Askable. Audited to the standards your security review expects.
Askable Labs runs on the same audited production platform as Askable. Controls live in code, not in process. Recruitment, consent, capture, tagging, review, and delivery are system calls, not procedures — no spreadsheet, no shared drive, no manual chain of custody.
Askable has operated since 2017, runs an Integrated Management System (IMS), and holds eight independent certifications — ISO/IEC 27001, 27701, and 42001, SOC 2 Type II, GDPR, CCPA, UK Cyber Essentials, and Wiz Cloud Security Excellence.
All certifications held by Askable, the parent platform — and apply directly to Askable Labs. SOC 2 report and penetration test summary available under MNDA.
Open the Trust CenterA production platform, not a services team.
Askable has run since 2017 as a SaaS platform for user research, trusted by over 3,000 clients including teams in banking and health insurance. Every step of a session — recruiting a practitioner, capturing their consent, ingesting the session, tagging the fragments, reviewing the output, delivering the batch — is a system call against that audited platform.
In a services model, each of those steps is a person with a laptop. The system is whoever is most careful that day. In our model, the system is the system.
Recruit
Consent
Capture
Review
Deliver
If you're training the next generation of models, train it with human jet fuel.
We work directly with a small number of frontier labs and applied teams. Bespoke capture briefs, schema co-design, exclusive batches.