You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
01/README.md

169 lines
6.4 KiB

11 months ago
<h1 align="center"></h1>
<p align="center">
10 months ago
<a href="https://discord.gg/Hvz9Axh84z"><img alt="Discord" src="https://img.shields.io/discord/1146610656779440188?logo=discord&style=social&logoColor=black"/></a>
11 months ago
<br>
<br>
5 months ago
<strong>The #1 open-source voice interface.</strong><br>
<br><a href="https://changes.openinterpreter.com">Get Updates</a> | <a href="https://01.openinterpreter.com/">Documentation</a><br>
11 months ago
</p>
10 months ago
<div align="center">
5 months ago
[中文版](docs/README_CN.md) | [日本語](docs/README_JA.md) | [English](README.md)
10 months ago
</div>
11 months ago
<br>
10 months ago
![OI-O1-BannerDemo-2](https://www.openinterpreter.com/OI-O1-BannerDemo-3.jpg)
11 months ago
10 months ago
We want to help you build. [Apply for 1-on-1 support.](https://0ggfznkwh4j.typeform.com/to/kkStE8WF)
11 months ago
<br>
10 months ago
> [!IMPORTANT]
10 months ago
> This experimental project is under rapid development and lacks basic safeguards. Until a stable `1.0` release, only run this repository on devices without sensitive information or access to paid services.
10 months ago
<br>
5 months ago
The **01** is an open-source platform for conversational devices, inspired by the *Rabbit R1* and *Star Trek* computer.
10 months ago
5 months ago
By centering this project on [Open Interpreter](https://github.com/OpenInterpreter/open-interpreter), the **01** is more natural, flexible, and capable than its predecessors. Assistants built from this repository can:
5 months ago
- Execute code
- Browse the web
- Read and create files
- Control third-party software
- ...
<br>
10 months ago
5 months ago
We intend to become the GNU/Linux of this new space by staying open, modular, and free.
10 months ago
<br>
# Software
11 months ago
```shell
5 months ago
git clone https://github.com/OpenInterpreter/01
cd 01/software
11 months ago
```
5 months ago
> Not working? Read the [setup docs](https://01.openinterpreter.com/software/introduction).
11 months ago
```shell
brew install ffmpeg portaudio poetry # mac only. windows and linux instructions below
5 months ago
poetry install
poetry run 01
11 months ago
```
<!-- > For a Windows installation, read our [setup guide](https://docs.openinterpreter.com/getting-started/setup#windows). -->
<br>
5 months ago
**Note:** The [RealtimeSTT](https://github.com/KoljaB/RealtimeSTT) and [RealtimeTTS](https://github.com/KoljaB/RealtimeTTS) libraries at the heart of the 01 are the work of [Kolja Beigel](https://github.com/KoljaB). Please star those repositories and consider contributing to those projects!
10 months ago
# Hardware
11 months ago
5 months ago
The **01** is also a hub for hardware devices that run or connect to our software.
10 months ago
5 months ago
- Mac, Windows, and Linux are supported by running `poetry run 01`. This starts the [01 server](https://01.openinterpreter.com/software/run) and a client that uses your `ctrl` key to simulate the 01 light.
5 months ago
- We have an Android and iOS application under development [here](software/source/clients/mobile).
5 months ago
- The 01 light is an ESP32-based, push-to-talk voice interface. Build documentation is [here.](https://01.openinterpreter.com/hardware/01-light/materials)
- It works by connecting to the [01 server](https://01.openinterpreter.com/software/run).
<br>
**We need your help supporting & building more hardware.** The 01 should be able to run on any device with input (microphone, keyboard, etc.), output (speakers, screens, motors, etc.), and an internet connection (or sufficient compute to run everything locally). [Contribution Guide ↗️](https://github.com/OpenInterpreter/01/blob/main/CONTRIBUTING.md)
10 months ago
10 months ago
<br>
10 months ago
# What does it do?
10 months ago
5 months ago
The 01 exposes a speech-to-speech websocket at `localhost:10101`.
10 months ago
10 months ago
If you stream raw audio bytes to `/` in [Streaming LMC format](https://docs.openinterpreter.com/guides/streaming-response), you will receive its response in the same format.
10 months ago
Inspired in part by [Andrej Karpathy's LLM OS](https://twitter.com/karpathy/status/1723140519554105733), we run a [code-interpreting language model](https://github.com/OpenInterpreter/open-interpreter), and call it when certain events occur at your computer's [kernel](https://github.com/OpenInterpreter/01/blob/main/software/source/server/utils/kernel.py).
10 months ago
The 01 wraps this in a voice interface:
10 months ago
10 months ago
<br>
<img width="100%" alt="LMC" src="https://github.com/OpenInterpreter/01/assets/63927363/52417006-a2ca-4379-b309-ffee3509f5d4"><br><br>
10 months ago
# Protocols
11 months ago
10 months ago
## LMC Messages
10 months ago
To communicate with different components of this system, we introduce [LMC Messages](https://docs.openinterpreter.com/protocols/lmc-messages) format, which extends OpenAIs messages format to include a "computer" role:
https://github.com/OpenInterpreter/01/assets/63927363/8621b075-e052-46ba-8d2e-d64b9f2a5da9
10 months ago
## Dynamic System Messages
10 months ago
Dynamic System Messages enable you to execute code inside the LLM's system message, moments before it appears to the AI.
10 months ago
```python
# Edit the following settings in Profiles
10 months ago
interpreter.system_message = r" The time is {{time.time()}}. " # Anything in double brackets will be executed as Python
10 months ago
interpreter.chat("What time is it?") # It will know, without making a tool/API call
11 months ago
```
10 months ago
# Guides
## 01 Server
To run the server on your Desktop and connect it to your 01 Light, run the following commands:
```shell
brew install ngrok/ngrok/ngrok
ngrok authtoken ... # Use your ngrok authtoken
5 months ago
poetry run 01 --server light --expose
10 months ago
```
The final command will print a server URL. You can enter this into your 01 Light's captive WiFi portal to connect to your 01 Server.
10 months ago
## Local Mode
11 months ago
```
5 months ago
poetry run 01 --profile local.py
11 months ago
```
10 months ago
## Customizations
To customize the behavior of the system, edit the [system message, model, skills library path,](https://docs.openinterpreter.com/settings/all-settings) etc. in the `profiles` directory under the `server` directory. This file sets up an interpreter, and is powered by Open Interpreter.
To specify the text-to-speech service for the 01 `base_device.py`, set `interpreter.tts` to either "openai" for OpenAI, "elevenlabs" for ElevenLabs, or "coqui" for Coqui (local) in a profile. For the 01 Light, set `SPEAKER_SAMPLE_RATE` in `client.ino` under the `esp32` client directory to 24000 for Coqui (local) or 22050 for OpenAI TTS. We currently don't support ElevenLabs TTS on the 01 Light.
10 months ago
10 months ago
## Ubuntu Dependencies
10 months ago
```bash
5 months ago
sudo apt-get install ffmpeg
12 months ago
```
# Contributors
10 months ago
[![01 project contributors](https://contrib.rocks/image?repo=OpenInterpreter/01&max=2000)](https://github.com/OpenInterpreter/01/graphs/contributors)
Please see our [contributing guidelines](CONTRIBUTING.md) for more details on how to get involved.
11 months ago
<br>
5 months ago
## Directory
5 months ago
### [Context ↗](https://github.com/KillianLucas/01/blob/main/CONTEXT.md)
5 months ago
The story that came before the 01.
5 months ago
### [Roadmap ↗](/ROADMAP.md)
1 year ago
5 months ago
The future of the 01.
<br>
10 months ago