teste-teste-teste / README.md
arthrod's picture
Update README.md
f453b83 verified

A newer version of the Gradio SDK is available: 5.36.2

Upgrade
metadata
license: apache-2.0
title: teste-teste-teste
sdk: gradio
emoji: πŸƒ
colorFrom: red
colorTo: red
short_description: yadayadayada
sdk_version: 5.33.0

Real-Time Screen Assistant - Premium Edition

This is a premium real-time screen assistant that integrates Google's Gemini 2.0 Live API with advanced screen recording capabilities.

Features

  • πŸŽ™οΈ Real-time Audio Streaming - Voice activity detection with noise filtering
  • πŸ–₯️ Professional Screen Recording - Native ScreenRecorder component with webcam overlay
  • πŸ€– AI Voice Responses - Bidirectional audio communication with Gemini 2.0
  • πŸ“ Text Response Display - Real-time text responses with conversation history
  • πŸ”„ Background Task Management - Proper async handling and cleanup
  • πŸ“Š Performance Monitoring - Real-time stats and adaptive quality

Setup

  1. Set your Google AI API key:

    export GEMINI_API_KEY="your-api-key-here"
    
  2. Install dependencies (automatic on HuggingFace Spaces):

    pip install -r requirements.txt
    
  3. Run the application:

    python app.py
    

Components

  • app.py - Main application with premium real-time integration
  • gradio_screenrecorder/ - Custom Gradio component for screen recording
  • requirements.txt - All necessary dependencies including custom components

Environment Variables

  • GEMINI_API_KEY - Required: Your Google AI API key for Gemini 2.0 Live API

Real-time Integration

This application implements complete real-time frontend integration:

  1. Continuous Audio Flow (User β†’ Model) - Voice activity detection
  2. Model Audio Output (Model β†’ User) - AI voice responses
  3. Screen Recording Integration - Professional screen capture
  4. Text Response Delivery (System β†’ User) - Real-time text display

All features are optimized for 300-second real-time sessions with adaptive quality and intelligent throttling.