Skip to content

Latest commit

 

History

History
52 lines (33 loc) · 3.45 KB

README.md

File metadata and controls

52 lines (33 loc) · 3.45 KB

VoiceWhiz

VoiceWhiz, your advanced ChatGPT virtual assistant, empowers you with voice-controlled interactions and intelligent responses. Harnessing OpenAI's cutting-edge language model, SpeakEZ assists you in a myriad of tasks, from answering questions and providing information to generating suggestions, telling jokes, and even crafting stories. It seamlessly transitions between spoken and written responses, adapting to your device's display capabilities.

Project Structure

The project is divided into four main python files:

  1. main.py: The main script that integrates all other modules, listens for the wake word, manages the audio recording, and handles the interaction with OpenAI and AWS Polly.

  2. stt_service.py: Handles the speech-to-text conversion

  3. tts_service.py: Handles the text-to-speech conversion using Amazon Polly.

  4. input_listener.py: Handles the audio recording and silence detection.

  5. chat_gpt_service.py: Manages the interaction with OpenAI's GPT-3 model.

There is also a configuration file, config.json, which stores important parameters and keys.

Features

  • Voice input: Users can speak to the virtual assistant, which converts their speech to text for processing.
  • Text input: Users can also type text directly into the application to interact with the virtual assistant.
  • Natural language understanding: The application utilizes the OpenAI GPT-3 model to understand and generate human-like responses.
  • Text-to-speech conversion: Responses from the virtual assistant are converted to audio for a more natural conversational experience.
  • Multithreading: The application uses multithreading to handle audio recording and processing simultaneously, ensuring a smooth user experience.

Key Features

  • Voice Interaction: Engage in natural conversations with SpeakEZ, whether you're asking questions or requesting assistance.
  • Wake Word Detection: Utilize Picovoice Porcupine to trigger SpeakEZ with a custom wake word.
  • Voice Activity Detection: Picovoice Cobra ensures SpeakEZ listens when you speak and stops when you're done.
  • Speech-to-Text Conversion: Picovoice Leopard accurately translates your spoken queries into text.
  • ChatGPT Response: Receive intelligent and context-aware responses generated by OpenAI's ChatGPT.
  • Text-to-Speech Output: Amazon Polly transforms responses into a natural-sounding human voice.
  • API Compatibility: SpeakEZ uses the latest ChatGPT API, with plans for a seamless transition to GPT-4 upon its widespread availability.

Getting Started on Raspberry Pi 4:

  1. Hardware Setup: Gather the necessary hardware for your Raspberry Pi 4.
  2. Create Accounts:
    • OpenAI: Create an OpenAI account and obtain your personal secret API key.
    • Picovoice: Create a Picovoice account and get your secret access key.
    • AWS: Create an AWS account and obtain your personal access key and secret access key.
  3. Follow Instructions: Execute the steps below to prepare your Raspberry Pi and download SpeakEZ - The ChatGPT Virtual Assistant.

Summary

Leverage the capabilities of Raspberry Pi and voice recognition technology to establish a seamless and natural interaction with ChatGPT. This hands-free methodology not only elevates user experience but also unlocks innovative opportunities for voice-based AI applications across diverse domains. Get your Raspberry Pi, explore your creative potential, and experience ChatGPT responding to your voice commands in unprecedented ways.