CREATOR DNA™ · LONGITUDINAL IDENTITY IN MUSIC AND VOICE
Identity is not a snapshot.
It is a pattern under constraint.
Identity is not similarity.
It is how a signal behaves within constraints over time.
— Scot Sier · Musician · Sonic Systems Inventor · Creator DNA Originator
As AI-generated music and voice become indistinguishable from human output, the problem is no longer creation. It's verification.
ABOUT THE ARTIST
Scot Sier
Scot Sier has been making music for as long as he can remember.
A violin prodigy through elementary school, following in the footsteps of his Indiana state champion violinist father and grandfather, he competed and won — until the day he heard Jimi Hendrix play guitar. That moment didn’t end his musical career. It expanded the boundaries of creative exploration.
What followed was a lifetime of work spanning from power-pop to experimental noise collaborations with musicians across the globe — not as phases, but as genuine, fully-realized creative expressions, each rooted in a different world of sound. The classical foundation never left. It went underground, showing up in places most listeners wouldn’t expect to find it.
That’s the nature of a deep musical gift. It doesn’t stay in one genre. It travels.
His unique musical style and use of non-standard guitar tunings is what inspired Creator DNA. The loss of his mother when he was a teenager deeply affected his creative work, where he focused more on collaboration and human rights in his body of work. That is the core of his invention — it redefines and empowers the artist by creating a sonic footprint that, even if copied by AI, can stand up against deepfake replication by showing each person has a unique style over time that can’t be accurately reproduced by artificial means.
The six-song test across six radically different genres — each individually analyzed by Creator DNA biometrics — shows how Scot’s sonic biometric fingerprint is consistent in every song. The structural instincts, compositional behavior, and creative logic that define his work are an artist proof of identity that establishes a baseline of authentication against deepfakes, an infrastructure system for music distribution and ownership rights in the AI era.
THE IDEA
Most systems ask: “Does this sound like X?”
Creator DNA asks: “Does this behave like X over time?”
Current AI classification is built on similarity — comparing surface features against known catalogs. Creator DNA operates at a different layer, measuring how a signal is organized across time, not how it appears at any single moment.
CURRENT SYSTEMS
“Does this sound like X?”
Similarity-based · Surface features · Can be imitated · Fails across genre transformation
CREATOR DNA
“Does this behave like X over time?”
Constraint-based · Behavioral layer · Longitudinal · Significantly harder to reproduce
THE EXPERIMENT
Six songs. Six genres. One creator.
A controlled dataset designed to stress-test identity. Six original compositions pushing into completely different stylistic spaces. If authorship identity were based on similarity, these tracks should appear unrelated. They are not.
6
ORIGINAL TRACKS
Power-pop · Roots-pop · Glam rock · Gypsy punk · Space rock · Acid-jazz-punk
6
GENRE CLUSTERS
Each classified by AI as belonging to a different artist lineage
1
CREATOR
All compositions, performances, and recordings: Scot Sier
SIX SONGS
Surface Difference, Structural Continuity
Listen to each track. The recordings are genuinely different in style, energy, and arrangement. Then look at the spectral data below each one.
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
2183 Hz
spectral centroid
Energy Variability
0.218
RMS CV
Harmonic Constraint
4.170
chroma entropy
What you're hearing
A high-energy recording with strong upper-mid presence and wide spectral spread. One of the brightest profiles in the dataset.
Signal observation
The spectrogram shows elevated energy across higher frequency bands with consistent rhythmic drive — confirming strong surface divergence.
Identity layer
Despite its brightness and density, harmonic behavior falls within the same constrained range observed across all six recordings. Identity persisting under high forward energy.
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
1839 Hz
spectral centroid
Energy Variability
0.210
RMS CV
Harmonic Constraint
4.258
chroma entropy
What you're hearing
A relaxed, narrative-driven performance with acoustic emphasis and wider tempo variability. Organic phrasing and open arrangement.
Signal observation
Reduced high-frequency intensity and more open spacing between harmonic events, consistent with a stripped-down acoustic arrangement.
Identity layer
Even with its looser feel and reduced spectral density, harmonic complexity remains within the same narrow band — authorship identity independent of production density.
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
1389 Hz
spectral centroid
Energy Variability
0.372
RMS CV
Harmonic Constraint
4.307
chroma entropy
What you're hearing
A dynamically shifting arrangement with complex chord movement and tonal contrast. The most dynamically elastic recording in the set.
Signal observation
Denser midrange with evolving harmonic clusters and the broadest dynamic variation across time — a darker, more theatrical spectral profile.
Identity layer
Despite its intricate structure and tonal shifts, harmonic entropy stays tightly bounded — complex arrangements following the same underlying compositional constraints.
04
Cocktail For The Blue
Gypsy Punk
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
2787 Hz
spectral centroid
Energy Variability
0.217
RMS CV
Harmonic Constraint
4.377
chroma entropy
What you're hearing
Fast articulation, rhythmic accents, and abrupt transitions between structured and energetic passages. The brightest spectral centroid in the dataset.
Signal observation
Rapid transient activity and alternating density patterns reflecting a hybrid rhythmic structure — ska pulse beneath punk energy.
Identity layer
Even with its rhythmic volatility and genre blending, harmonic behavior remains consistent with the rest of the dataset — identity persisting across structural extremes.
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
2014 Hz
spectral centroid
Energy Variability
0.256
RMS CV
Harmonic Constraint
4.381
chroma entropy
What you're hearing
An atmospheric recording with sustained tones and gradual harmonic transitions. The slowest tempo in the set at 89 BPM.
Signal observation
Smoother, more continuous energy distribution with less abrupt transient activity — a softer, more expansive spectral profile.
Identity layer
Despite its slower evolution and softer profile, harmonic entropy still falls within the same constrained range — stability demonstrated at low intensity as well as high.
SPECTRAL FINGERPRINT — time → frequency → energy
Brightness
1874 Hz
spectral centroid
Energy Variability
0.135
RMS CV
Harmonic Constraint
4.291
chroma entropy
What you're hearing
A rhythm-forward track with strong attack transients and defined structural sections. The most dynamically locked recording in the set.
Signal observation
Sharp, repeated bursts of energy corresponding to rhythmic hits and a compressed, punchy dynamic profile throughout.
Identity layer
Even with its aggressive rhythmic character, harmonic behavior remains within the same narrow band — identity preserved under high-energy percussive conditions.
FINDINGS
Real divergence on the surface.
Tight convergence underneath.
2.0×
BRIGHTNESS VARIATION
Spectral centroid: 1389 – 2787 Hz
2.2×
TEMPO VARIATION
89 – 199 BPM across the dataset
2.7×
DYNAMICS VARIATION
RMS CV range: 0.135 – 0.372
KEY FINDING — THE IDENTITY LAYER
Across all six tracks, one signal remained tightly consistent: chroma entropy — how harmonic energy is distributed across the chromatic scale.
The coexistence of high divergence and low entropy variance indicates a constraint-based generative system rather than genre-dependent composition. A similarity-based system cannot detect this constraint, because it evaluates surface features rather than the distribution rules governing those features.
The strongest measurable signal in this dataset is the constrained chroma entropy band. Style changes. Production changes. Genre changes. But the underlying behavioral system does not.
0.211
TOTAL ENTROPY RANGE
2.0%
COEFFICIENT OF VARIATION
CHROMA ENTROPY PER TRACK — ALL SIX FALL WITHIN THE GREEN BAND
Shannon entropy (base e) of normalized chroma vectors, averaged over frames. Scale: 3.9 – 4.6.
PROOF
AI can copy what something sounds like.
Creator DNA measures what something is.
A voice can be cloned. A style can be imitated. But origin cannot be verified by similarity alone. Creator DNA introduces longitudinal identity verification — evaluating whether a signal stays within the same behavioral constraints across time.
AI SYSTEMS TODAY CAN REPLICATE
- Tone and timbre
- Phrasing and rhythm
- Genre conventions
- Stylistic surface features
WHAT REMAINS HARDER TO REPRODUCE
- Constraint-based behavioral identity
- Longitudinal harmonic distribution
- Authorship signal across genres
- Origin — not just appearance
CREATOR DNA APPLIES TO
- Music authorship verification
- Voice authentication
- AI-generated content detection
- Content provenance
NEXT STEPS FOR VALIDATION
- Larger datasets
- Cross-creator comparison baseline
- Blind testing protocols
- Independent replication
HARMONIC IDENTITY LAYER
A Second Independent Signal.
Compositional Identity.
Chroma entropy measures compositional identity. This layer measures something different — the harmonic behavioral signature extracted from each recording: how melodic energy is voiced, how timbral character distributes across frequency, how pitch behavior organizes over time.
Two independent analytical layers. Six different genres. One consistent origin signal.
17.3%
F0 COEFFICIENT OF VARIATION
Fundamental pitch consistency across genres
14.5%
TIMBRAL IDENTITY CV
Harmonic envelope shape across all six recordings
91–96%
VOICED RATIO — 5 OF 6 TRACKS
Consistent melodic presence behavior
VOICED RATIO PER TRACK — MELODIC PRESENCE CONSISTENCY
Proportion of harmonic signal carrying melodic identity. Scale: 0.80 – 1.00. Green band = consistency window.
TIMBRAL IDENTITY (MFCC2) — HARMONIC ENVELOPE SIGNATURE
Spectral shape of harmonic layer averaged across full recording. Scale: 80 – 180.
KEY FINDING — HARMONIC IDENTITY LAYER
Across six radically different genres, two independent signal layers show the same result: one behavioral origin.
The harmonic identity analysis — extracted independently from the compositional chroma analysis — shows consistent timbral character (14.5% CV), consistent fundamental pitch behavior (17.3% CV), and consistent melodic presence across five of six recordings. These are not the same measurement as chroma entropy. They are measuring different properties of the signal. They are converging on the same conclusion.
When two independent analytical layers both show tight consistency across six different genres, the probability of that being coincidental collapses. This is what a behavioral identity signal looks like.
2
INDEPENDENT SIGNAL LAYERS
ANALYTICAL NOTE
This layer uses harmonic-percussive source separation (HPSS) to extract the melodic/harmonic component of each recording, then applies MFCC timbral analysis, pYIN fundamental frequency estimation, and voiced ratio detection. This measures the combined harmonic behavioral signature of voice and melodic instrumentation. True isolated vocal biometrics (formant analysis F1/F2/F3) would constitute an additional third identity layer. Analysis performed using librosa 0.11.0.
VOCAL BIOMETRIC LAYER
A Third Independent Signal.
Your Vocal Tract. Your Identity.
Formant frequencies are determined by the physical shape and length of your vocal tract — your throat, mouth, and nasal cavity. They are anatomical constants. They do not change with genre, style, emotion, or intent. They cannot be trained away or imitated with precision.
Four isolated vocal tracks across four different genres were analyzed using Praat acoustic phonetics software — the same tool used in forensic voice analysis and clinical speech research. The results speak for themselves.
4.0%
F3 VARIATION
Vocal tract length signature
5.0%
F2 VARIATION
Tongue resonance position
8.6%
F1 VARIATION
Vowel space / jaw signature
9.4%
F0 VARIATION
Fundamental pitch consistency
F3 — VOCAL TRACT LENGTH SIGNATURE (most anatomically fixed formant)
Hz. Scale: 2600 – 3200. 4.0% coefficient of variation across four genres.
F2 — TONGUE RESONANCE POSITION
Hz. Scale: 1500 – 2100. 5.0% coefficient of variation across four genres.
F1 — VOWEL SPACE / JAW HEIGHT SIGNATURE
Hz. Scale: 600 – 1000. 8.6% coefficient of variation across four genres.
PER-TRACK VOCAL BIOMETRICS
| TRACK |
F0 |
F1 |
F2 |
F3 |
HNR |
| The Clown |
192.7 |
743 |
1830 |
2835 |
16.8dB |
| Time Out |
218.9 |
702 |
1697 |
2788 |
11.3dB |
| Rusty Cadillac |
243.4 |
884 |
1893 |
2953 |
17.8dB |
| Colonizer |
197.9 |
781 |
1943 |
3087 |
15.1dB |
WHY F3 MATTERS MOST
F3 — the third formant — is primarily determined by the length of your vocal tract from glottis to lips. It is the most anatomically fixed of all formants. It does not change based on what you are singing, how you are singing it, or what genre surrounds it.
Across Glam Rock, Acid-Jazz-Punk, Roots-Pop, and Space Rock, F3 ranges from 2788 to 3087 Hz — a 4.0% window. This is a vocal tract signature. It belongs to one person.
KEY FINDING — THREE INDEPENDENT IDENTITY LAYERS
Three separate analyses. Three different measurement systems. One origin.
Chroma entropy measures compositional behavior. Harmonic MFCC analysis measures timbral identity. Formant analysis measures vocal tract anatomy. These are not variations of the same test. They are independent signals drawn from fundamentally different properties of the recordings. All three converge on the same conclusion.
This is what Creator DNA is built on — not one signal, but the convergence of multiple independent behavioral and anatomical layers into a single verifiable origin point. That convergence is what AI cannot replicate. That convergence is the future of authentication.
2.0%
CHROMA ENTROPY VARIANCE
ANALYTICAL NOTE
Vocal analysis performed on four isolated vocal tracks using Praat acoustic phonetics software via parselmouth 0.4.x (Python). Formant extraction used Burg method, 5 formants, 5500Hz ceiling, 25ms window. Pitch analysis: 75–600Hz range. HNR: harmonic-to-noise ratio (cc method). Analysis covers The Clown (Glam Rock), Time Out (Acid-Jazz-Punk), Rusty Cadillac (Roots-Pop), Colonizer (Space Rock). Isolated vocal tracks provided by artist.
These six tracks are different. But they are not from different creators.
Current AI systems would get that wrong. They listen for similarity. They compare surfaces. In a world where any voice can be cloned and any style can be imitated, that approach has already failed.
Creator DNA Biometric behavioral analysis asks how a signal behaves over time, and the answer to that question is, origin.
This is the future of authentication. Not just in music. In every creative field where human identity must be distinguished from artificial reproduction.
CREATOR DNA™ · SCOT SIER · MUSICIAN · SONIC SYSTEMS INVENTOR · CREATOR DNA ORIGINATOR
Early-stage proof-of-concept · n=6 tracks · Sample data · Descriptive findings only · Not peer-reviewed · Not a legal instrument
© Scot Sier. All compositions, performances, and recordings by Scot Sier.