All notable changes to this project will be documented here.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
[Unreleased]
[0.0.1] — 2025-02-18
Added
- Core
Utteranceclass with event-driven API (speechStart,pause,turnEnd,interrupt) TurnDetectorstate machine for turn-taking logic with configurable sensitivity and pause toleranceAudioCapturemodule using the Web Audio API (microphone access, real-time audio processing)FeatureExtractorwith energy computation and stubs for MFCCs, pitch, and speech rateONNXModelmodule for ONNX Runtime Web inferenceEnergyVADbaseline classifier using RMS energy thresholds as fallback when no ONNX model is loaded- Dual-format build: ESM (
dist/index.js) and CommonJS (dist/index.cjs) with TypeScript declarations - Shared type definitions in
src/types.ts - Documentation site at utterance.dev with quick start, API reference, and architecture guides
- Interactive playground at
/playgroundfor real-time event testing - 11 passing tests for the detector and feature extractor