mannpatel/VoiceVault

Fork 0

Go to file

Gaumit Kauts dc67cb8e31 Update README.md

2026-02-15 01:43:43 -07:00

backend

rag vector embeddings

2026-02-15 01:21:05 -07:00

database

Feat: frontend working almost

2026-02-14 23:00:13 -07:00

frontend

feat: audio playback working with bugs

2026-02-15 00:24:49 -07:00

.env

speech_to_text functionality updated

2026-02-14 19:10:03 -07:00

LICENSE

Initial commit

2026-02-14 10:45:01 -07:00

README.md

Update README.md

2026-02-15 01:43:43 -07:00

speech_to_text.py

speech_to_text functionality updated

2026-02-14 19:10:03 -07:00

README.md

VoiceVault

Schema-driven archival audio backend + frontend.

What It Does

Register / login users
Upload original audio/video to Supabase Storage bucket (archives)
Create audio_posts records in Postgres
Transcribe media locally with faster-whisper
Save transcript chunks to rag_chunks (with embeddings)
Build prompt context and store in archive_metadata
Search user chunks with RAG endpoint (vector mode or text fallback)

Project Structure

backend/main.py Flask app entry
backend/api_routes.py API routes and upload/transcription flow
backend/db_queries.py Supabase DB/storage helpers
schema.sql database schema
frontend/ React app

Environment (`backend/.env`)

Required:

SUPABASE_URL
SUPABASE_SERVICE_ROLE_KEY (service role key, not publishable key)
SUPABASE_BUCKET=archives

Optional:

BACKEND_UPLOAD_DIR=uploads
WHISPER_MODEL=base
WHISPER_DEVICE=cpu
WHISPER_COMPUTE_TYPE=int8

Run Backend

cd backend
python main.py

Backend runs on http://localhost:5000.

Run Frontend

cd frontend
npm install
npm run dev

Set frontend API base to http://127.0.0.1:5000/api (or your backend host).

Core API Endpoints

Auth:

POST /api/auth/register
POST /api/auth/login

Upload + processing:

POST /api/posts/upload (multipart form-data: file, user_id, title, visibility, optional metadata)

History + RAG:

GET /api/users/<user_id>/history
GET /api/rag/search?user_id=<id>&q=<text>
GET /api/rag/search?user_id=<id>&query_embedding=[...]

Playback:

GET /api/posts/<post_id>/audio-url?user_id=<id> (required for private posts)

Post data:

GET /api/posts
GET /api/posts/<post_id>
GET /api/posts/<post_id>/bundle
GET /api/posts/<post_id>/files
GET /api/posts/<post_id>/chunks

Notes

Original media is stored in Supabase Storage; DB stores the object path in archive_files (role=original_audio).
Transcript text/chunks/metadata/audit remain in Postgres tables.
If storage upload fails with RLS errors, verify service-role key and bucket policies.

README.md

VoiceVault

What It Does

Project Structure

Environment (backend/.env)

Run Backend

Run Frontend

Core API Endpoints

Notes

Environment (`backend/.env`)