Lightweight observation instead of artificial precision.
VDMS-Light is a compact observational framework describing how synthetic voices behave under tonal, social, emotional and linguistic drift.
Vectors are observational markers.
Arcadia does not treat voice vectors as objective scientific measurements. The values are lightweight listening markers used to describe perceived tendencies inside a synthetic identity.
The system intentionally avoids excessive mathematical complexity. Its purpose is not biometric certainty, but practical comparison, drift observation and prompt distillation.
A small number of stable dimensions.
Warmth & clarity
Describes resonance behavior, tonal softness, intelligibility and perceived vocal density.
Movement & timing
Observes pacing, pause structure, melodic motion and rhythmic tightness.
Presence & distance
Describes conversational closeness, social pressure and perceived human proximity.
Tension & reflection
Tracks subtle emotional pressure without reducing voices to simple emotional labels.
JESPER-DE source profile.
The current source profile is continuously refined through comparative listening and derivative testing.
JESPER_SOURCE: warmth: 68 clarity: 82 breathiness: 7 rhythm_tightness: 79 melodic_motion: 12 social_proximity: 24 internationality: 91
Voice systems interpret language more reliably than vectors.
Arcadia does not assume that systems such as ElevenLabs Voice Remixing understand vector values as direct commands. Internal vectors are therefore translated into compact natural-language prompts.
Observation → Distillation → Prompt
VECTOR SHIFT: melodic_motion: -10 social_proximity: -8 breathiness: -4 DISTILLED PROMPT: Restrained melodic phrasing, composed social distance, minimal breath texture, while preserving the calm core identity.
Some values interact unexpectedly.
Strong melodic movement can rapidly alter perceived regional identity and social character.
Increased conversational closeness often destabilizes otherwise stable source identities.
Small breath adjustments can strongly influence intimacy and emotional realism.
Rhythmic control affects perceived authority, narrative structure and international intelligibility.
VDMS-Light is intentionally incomplete.
The framework does not claim exhaustive modeling of human voice identity. Some effects remain unstable, context-dependent or difficult to separate cleanly.
The goal is practical orientation and controlled comparison — not total abstraction of the human voice.
Listening remains more important than numbers.
VDMS-Light only becomes meaningful through repeated listening and contextual interpretation. Human evaluation remains necessary to determine whether a derivative still feels plausible, stable and culturally coherent.
