Research & Development

Problemswe'reworkingon

Novel challenges in voice AI, workflow automation, and intelligent systems. Not theoretical research - real problems from production systems serving thousands of users.

Engineering Hub Join the Team

Active research areas

These aren't side projects. They're core to what we're building and directly impact the products our customers use.

Real-Time Voice AI

Active

Building conversational AI agents that handle phone calls naturally. The technical challenges include sub-200ms latency, interruption handling, turn-taking detection, and graceful degradation on poor audio.

Key Challenges

End-of-turn detection without cutting people off
Barge-in handling - stopping mid-sentence when interrupted
Audio quality adaptation for noisy environments
Streaming everything for minimum latency

Related Writing

The Hard Problems in Real-Time Voice AI

Custom AI Models

Active

General-purpose LLMs are impressive but expensive and slow for high-volume, domain-specific tasks. We build custom models for intent classification, entity extraction, and quality scoring.

Key Challenges

Intent classification with 50ms latency budget
Domain-specific entity extraction at scale
Real-time conversation quality scoring
Model serving infrastructure for production loads

Intelligent Workflow Automation

Active

Moving beyond rule-based automation to systems that can suggest workflow improvements, predict bottlenecks, and handle exceptions intelligently.

Key Challenges

Predicting SLA breaches before they happen
Automated workflow optimisation suggestions
Handling edge cases without human intervention
Maintaining explainability in automated decisions

Current projects

Specific initiatives within our research areas. Some are close to production, others are early exploration.

Predictive SLA Management

In Development

Using historical patterns to identify cases likely to breach deadlines before they do. Early warning means early intervention.

Multi-Turn Conversation Memory

In Development

Maintaining context across long conversations without losing track of what's been discussed or decided.

Emotion-Aware Responses

Research

Detecting caller frustration or confusion from voice signals and adapting agent behaviour accordingly.

Multi-Party Voice Handling

Research

Conference calls with AI agents - handling multiple speakers, turn-taking between humans and AI.

How we approach research

Research that doesn't ship isn't useful. We balance exploration with practical constraints - everything we build needs to work in production.

Provider Abstraction

No vendor lock-in. Clean interfaces let us swap AI providers without changing application code. When a better model emerges, we can test it on production traffic.

Streaming Everything

Voice AI can't wait for complete responses. Audio streams to transcription while speaking, transcripts stream to LLM before finished, responses stream to TTS as tokens generate.

Graceful Degradation

When providers fail or audio quality degrades, the system adapts. Fallback to simpler models, ask for clarification, or escalate to humans - never just fail silently.

Continuous Measurement

Every conversation generates metrics. Latency, accuracy, user satisfaction, task completion. You can't improve what you don't measure.

Open questions

Problems we haven't solved yet. If you have ideas or have tackled similar challenges, we'd love to hear from you.

How do you detect end-of-turn reliably across accents and speaking styles?

Simple silence detection doesn't work. Falling intonation helps but isn't universal. Linguistic completeness helps but requires understanding. We're combining signals but it's still imperfect.

What's the right level of autonomy for AI agents in business processes?

Full automation is risky for high-stakes decisions. Full human review doesn't scale. The boundary is context-dependent and we're still learning where to draw it.

How do you maintain conversation context without running into token limits?

Long conversations exceed context windows. Summarisation loses detail. Retrieval adds latency. There's no perfect solution, only trade-offs.

How do you evaluate voice AI quality at scale?

Automated metrics catch technical issues but miss naturalness. Human evaluation doesn't scale. We're exploring hybrid approaches.

Technical writing

Detailed writeups of problems we've solved and approaches we've taken.

Architecture

Building a Workflow Engine That Handles 11.8 Million Cases

Modular architecture, event-driven processing, and the decisions that let us scale.

AI Infrastructure

Why We Built Switchboard With Custom and Multi-Provider AI

Custom models for domain-specific tasks, external providers for general reasoning.

Voice AI

The Hard Problems in Real-Time Voice AI

Latency budgets, interruption handling, turn-taking, and audio quality challenges.

Engineering

What 15 Years of Building Software Taught Us

Boring technology wins, refactor continuously, tests are documentation.

Work on interesting problems

If these challenges sound interesting, we're hiring. Small team, real problems, code that ships to production.

View Engineering Roles Get in Touch