Echoes of WHALE FALL
Machine learning and emotional analysis
audio-visual system
Echoes of Whale Fall
Echoes of Whale Fall is an emotion-driven audiovisual system that uses machine learning to analyze emotional signals, transforming deep-sea sounds and imagery into a generative narrative of loss, memory, and rebirth.
Team
Lilith Ren
Location
Harvard GSD
Dates
25 Spring
Machine Learning

Dataset & Source Material
A curated dataset of deep-sea whale footage and acoustic recordings forms the foundation of the system.
These clips are segmented into narrative moments such as emergence, movement, and collective behavior, providing structured input for emotional annotation.
Framework
Narrative Framework to Emotion Driven Visualization System to Generative Audio-Visual Environment





Machine Learning Process
Signal Processing & Abstraction
Raw audiovisual data is decomposed into multiple layers of information, including motion patterns, intensity shifts, and temporal rhythms. This abstraction process translates natural phenomena into analyzable signals that can be computationally interpreted.

Storyboard of Whale Fall



Emotional Mapping (ML Layer)
Each segment is annotated using a three-dimensional affective model—arousal, dominance, and valence—creating an emotional dataset. This structured mapping enables machine learning to operate as a bridge between sensory input and generative output.


Emotional Analysis and Sound Pipeline
The emotional data drives a non-linear sequencing system, where transitions between states (e.g., silence, emergence, loss) are dynamically arranged. Rather than a fixed timeline, the narrative evolves through emotional continuity and variation.


Generative Visual Output
Machine learning outputs are translated into visual forms, where parameters such as density, motion, and color respond to emotional states. This results in an evolving visual language that
reflects the underlying affective structure of the data.


System Implementation
The system is implemented in Python, integrating audio processing and rule-based generation.
Machine learning here functions not as prediction, but as a mediating framework that continuously maps emotional inputs to audiovisual behaviors.