Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Implement core real-time interaction capabilities for Gemini API #32

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

vizvasrj
Copy link

This commit introduces the fundamental features for real-time communication with the Gemini API, encompassing:

  • Real-time audio streaming: Enables bidirectional audio interaction using microphone input and API responses.
  • Real-time video streaming (camera and screen capture): Allows streaming video from either a webcam or the screen, configurable via the MODE environment variable. Includes image resizing and encoding for efficient transmission.
  • Text-based prompt functionality: Implements the ability to send text prompts to the Gemini API via the command line.
  • Graceful shutdown mechanism: Ensures proper cleanup of audio and video resources, including closing streams and websocket connections, upon application exit.
  • Initial README documentation: Provides a comprehensive guide on project setup, configuration, and usage examples for various modes.

This commit introduces the fundamental features for real-time communication with the Gemini API, encompassing:

- Real-time audio streaming: Enables bidirectional audio interaction using microphone input and API responses.
- Real-time video streaming (camera and screen capture): Allows streaming video from either a webcam or the screen, configurable via the `MODE` environment variable. Includes image resizing and encoding for efficient transmission.
- Text-based prompt functionality: Implements the ability to send text prompts to the Gemini API via the command line.
- Graceful shutdown mechanism: Ensures proper cleanup of audio and video resources, including closing streams and websocket connections, upon application exit.
- Initial README documentation: Provides a comprehensive guide on project setup, configuration, and usage examples for various modes.
@vizvasrj
Copy link
Author

vizvasrj commented Jan 5, 2025

any one going to review his?

@happy-qiao
Copy link
Collaborator

Thank you for your contribution. This repository is not yet open for external contributions. We will update once we have established a process for accepting pull requests.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants