Go to file

Jose 5a8dc9c489 fix 🐛: Update bin range calculation and formatting in src/main.cpp

- The code provided is a JavaScript application that visualizes frequency data from a WebSocket connection. It uses the Chart.js library to create an interactive graph that displays the spectrum of incoming audio. The chart includes annotations for specific frequencies and tooltips with detailed frequency information.

Here's a breakdown of the key components:

1. **Note Frequencies**:
- `noteFrequencies` is an object containing note names as keys and their corresponding frequencies in Hz.

2. **Chart Setup**:
- `chart` initializes a Chart.js chart with the specified type, dimensions, options, plugins, data, and animation.
- `chart.update()` updates the chart to reflect any changes in the dataset.

3. **WebSocket Connection**:
- A WebSocket connection (`ws`) is established to receive frequency data from a server running on the same hostname as the client.
- The `onmessage` event handler receives JSON-encoded data, which represents the frequency spectrum.
- The received spectrum data is interpolated using logarithmic scale to enhance the visual representation of the spectrum.

4. **Interpolation**:
- The `interpolateSpectrum` function converts the linear frequency bins of the spectrum to logarithmic scale and interpolates between the closest two bins for a smoother display.
- This interpolation helps in better handling of low-frequency components that may not be accurately represented by simple linear steps.

5. **Legend Disabling**:
- The `legend: { display: false }` option disables the legend in the chart, which is useful when multiple datasets are displayed.

This application provides a dynamic way to visualize audio frequency data, allowing users to interact with the spectrum and gain insights into the content of the incoming audio.
- The changes introduced in `src/main.cpp` are related to the calculation and formatting of bin ranges for frequency detection, specifically focusing on the FFT size from 60 Hz to 1100 Hz.

2025-04-25 19:52:45 +02:00

data

fix 🐛: Update bin range calculation and formatting in src/main.cpp

2025-04-25 19:52:45 +02:00

include

feat(data) undefined: Added new HTML file for piano spectrum analyzer. fix(Config.h): Updated WiFi settings and web configuration portal details. refactor(Arduino sketch): Added WiFi and WebSocket support, enabling real-time spectrum data transmission over the

2025-04-25 19:00:28 +02:00

lib

feat ✨: Added .gitignore, README files, partitions, build options, dependencies, and configuration for ESP32-S3 development board using Arduino.

2025-04-18 18:23:25 +02:00

src

fix 🐛: Update bin range calculation and formatting in src/main.cpp

2025-04-25 19:52:45 +02:00

test

feat ✨: Added .gitignore, README files, partitions, build options, dependencies, and configuration for ESP32-S3 development board using Arduino.

2025-04-18 18:23:25 +02:00

.gitignore

feat ✨: Added .gitignore, README files, partitions, build options, dependencies, and configuration for ESP32-S3 development board using Arduino.

2025-04-18 18:23:25 +02:00

partitions.csv

feat ✨: Added .gitignore, README files, partitions, build options, dependencies, and configuration for ESP32-S3 development board using Arduino.

2025-04-18 18:23:25 +02:00

platformio.ini

plaintext feat undefined: Updated Arduino project with new features for audio processing. fix: Updated I2SConfig.h to include additional parameter for reading I2S samples. refactor: Improved performance by rounding integer samples to

2025-04-25 12:14:06 +02:00

README.md

2025-04-25 12:14:06 +02:00

README.md

ESP32 Piano Note Detection System

A real-time piano note detection system implemented on ESP32 using I2S microphone input. This system can detect musical notes from C2 to C6 with adjustable sensitivity and visualization options.

Features

Real-time audio processing using I2S microphone
FFT-based frequency analysis
Note detection from C2 (65.41 Hz) to C6 (1046.50 Hz)
Dynamic threshold calibration
Multiple note detection (up to 7 simultaneous notes)
Harmonic filtering
Real-time spectrum visualization
Note timing and duration tracking
Interactive Serial commands for system tuning

Hardware Requirements

ESP32 development board
I2S MEMS microphone (e.g., INMP441, SPH0645)
USB connection for Serial monitoring

Pin Configuration

The system uses the following I2S pins by default (configurable in Config.h):

SCK (Serial Clock): GPIO 8
WS/LRC (Word Select/Left-Right Clock): GPIO 9
SD (Serial Data): GPIO 10

Getting Started

Connect the I2S microphone to the ESP32 according to the pin configuration
Build and flash the project to your ESP32
Open a Serial monitor at 115200 baud
Follow the calibration process on first run

Serial Commands

The system can be controlled via Serial commands:

h - Display help menu
c - Start calibration process
+ - Increase sensitivity (threshold up)
- - Decrease sensitivity (threshold down)
s - Toggle spectrum visualization

Configuration Options

All system parameters can be adjusted in Config.h:

Audio Processing

SAMPLE_RATE: 8000 Hz (good for frequencies up to 4kHz)
BITS_PER_SAMPLE: 16-bit resolution
SAMPLE_BUFFER_SIZE: 1024 samples
FFT_SIZE: 1024 points

Note Detection

NOTE_FREQ_C2: 65.41 Hz (lowest detectable note)
NOTE_FREQ_C6: 1046.50 Hz (highest detectable note)
FREQUENCY_TOLERANCE: 3.0 Hz
MAX_SIMULTANEOUS_NOTES: 7
MIN_NOTE_DURATION_MS: 50ms
NOTE_RELEASE_TIME_MS: 100ms

Calibration

CALIBRATION_DURATION_MS: 5000ms
CALIBRATION_PEAK_PERCENTILE: 0.95 (95th percentile)

Visualization

The system provides two visualization modes:

Note Display:

Current Notes:
A4 (440.0 Hz, Magnitude: 2500, Duration: 250ms)
E5 (659.3 Hz, Magnitude: 1800, Duration: 150ms)

Spectrum Display (when enabled):

Frequency Spectrum:
0Hz    |▄▄▄▄▄
100Hz  |██████▄
200Hz  |▄▄▄
...

Performance Tuning

Start with calibration by pressing 'c' in a quiet environment
Play notes and observe the detection accuracy
Use '+' and '-' to adjust sensitivity if needed
Enable spectrum display with 's' to visualize frequency content
Adjust Config.h parameters if needed for your specific setup

Implementation Details

Uses FFT for frequency analysis
Implements peak detection with dynamic thresholding
Filters out harmonics to prevent duplicate detections
Tracks note timing and duration
Uses ring buffer for real-time processing
Calibration collects ambient noise profile

Troubleshooting

No notes detected:
- Check microphone connection
- Run calibration
- Increase sensitivity with '+'
- Verify audio input level in spectrum display
False detections:
- Run calibration in a quiet environment
- Decrease sensitivity with '-'
- Adjust PEAK_RATIO_THRESHOLD in Config.h
Missing notes:
- Check if notes are within C2-C6 range
- Increase FREQUENCY_TOLERANCE
- Decrease MIN_MAGNITUDE_THRESHOLD

Contributing

Contributions are welcome! Please read the contributing guidelines before submitting pull requests.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Development Environment Setup

Prerequisites

PlatformIO IDE (recommended) or Arduino IDE
ESP32 board support package
Required libraries:
- arduino-audio-tools
- arduino-audio-driver
- WiFiManager
- AsyncTCP
- ESPAsyncWebServer
- arduinoFFT

Building with PlatformIO

Clone the repository
Open the project in PlatformIO
Install dependencies:
```
pio lib install
```
Build and upload:
```
pio run -t upload
```

Memory Management

Memory Usage

Program Memory: ~800KB
RAM Usage: ~100KB
DMA Buffers: 4 x 512 bytes
FFT Working Buffer: 2048 bytes (1024 samples x 2 bytes)

Optimization Tips

Adjust DMA_BUFFER_COUNT based on available RAM
Reduce SAMPLE_BUFFER_SIZE for lower latency
Use PSRAM if available for larger buffer sizes

Advanced Configuration

Task Management

Audio processing runs on Core 1
Main loop on Core 0
Configurable priorities in Config.h

Audio Pipeline

I2S DMA Input
Sample Buffer Collection
FFT Processing
Peak Detection
Note Identification
Output Generation

Timing Parameters

Audio Buffer Processing: ~8ms
FFT Computation: ~5ms
Note Detection: ~2ms
Total Latency: ~15-20ms

Performance Optimization

CPU Usage

Audio Processing: ~30% on Core 1
Note Detection: ~20% on Core 1
Visualization: ~10% on Core 0

Memory Optimization

Buffer Size Selection:
- Larger buffers: Better frequency resolution
- Smaller buffers: Lower latency
DMA Configuration:
- More buffers: Better continuity
- Fewer buffers: Lower memory usage

Frequency Analysis

FFT Resolution: 7.8125 Hz (8000/1024)
Frequency Bins: 512 (Nyquist limit)
Useful Range: 65.41 Hz to 1046.50 Hz
Window Function: Hamming

Technical Details

Microphone Specifications

Supply Voltage: 3.3V
Sampling Rate: 8kHz
Bit Depth: 16-bit
SNR: >65dB (typical)

Signal Processing

Pre-processing:
- DC offset removal
- Windowing function application
FFT Processing:
- 1024-point real FFT
- Magnitude calculation
Post-processing:
- Peak detection
- Harmonic filtering
- Note matching

Calibration Process

Ambient Noise Collection (5 seconds)
Frequency Bin Analysis
Threshold Calculation:
- Base threshold from 95th percentile
- Per-bin noise floor mapping
Dynamic Adjustment

Error Handling

Common Issues

I2S Communication Errors:
- Check pin connections
- Verify I2S configuration
- Monitor serial output for error codes
Memory Issues:
- Watch heap fragmentation
- Monitor stack usage
- Check DMA buffer allocation

Error Recovery

Automatic I2S reset on communication errors
Dynamic threshold adjustment
Watchdog timer protection

Project Structure

Core Components

AudioLevelTracker
- Real-time audio level monitoring
- Peak detection
- Threshold management
NoteDetector
- Frequency analysis
- Note identification
- Harmonic filtering
SpectrumVisualizer
- Real-time spectrum display
- Magnitude scaling
- ASCII visualization

File Organization

/src: Core implementation files
/include: Header files and configurations
/data: Additional resources
/test: Unit tests