You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
01/TASKS.md

41 lines
2.6 KiB

This file contains invisible Unicode characters!

This file contains invisible Unicode characters that may be processed differently from what appears below. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to reveal hidden characters.

**OI**
1. Finish skill library.
1. Create a system message that includes instructions on how to use the skill library.
2. Test it end-to-end.
3. Create computer.skills.teach(), which displays a message asking users to complete the task via voice in the most generalizable way possible. OI should use computer.mouse and computer.keyboard to fulfill each step, then save the generalized instruction as a skill. Clicking the mouse cancels teach mode. When OI invokes this skill in the future, it will just list those steps (it needs to figure out how to flexibly accomplish each step).
4. Expose ^ via `interpreter --teach`.
2. Add `interpreter --server --expose`.
1. Include the 01's --server in the next OI update.
2. Add --server --expose which will expose the server via something like Ngrok, display the public URL and a password, so the 01 Light can connect to it. This will let people use OI on their computer via their Light — i.e. "Check my emails" will run Applescript on their home computer.
3. Why is OI starting so slowly? We could use time.time() around things to track it down.
4. Create moondream-powered computer.camera.
1. Computer.camera.view(query) should take a picture and ask moondream the query. Defaults to "Describe this image in detail."
**OS**
1. Swap out the current hosted functions for local ones.
1. TTS — Piper? OpenVoice? Rasspy?
2. STT — Whisper? Canary?
3. LLM — Phi-2 Q4 Llamafile, just need to download it, OI knows how to use Llamafiles
2. If Light and no internet, open a captive wifi page with text boxes: Wifi Name, Wifi Pass, (optional) Server URL, (optional) Server Pass
3. Can tapping the mic twice = pressing the "button"? Simple sensing, just based on volume spikes?
4. Add basic TUI to device.py. Just renders messages and lets you add messages. Can easily copy OI's TUI.
5. Replace bootloader and boot script— should just run 01, full screen TUI.
6. Package it as an ISO, or figure out some other simple install instructions. How to easily install on a Pi?
**Hardware**
1. (Hardware and software) Get the 01OS working on the Jetson or Pi. Pick one to move forward with.
2. Connect the Seeed Sense (ESP32 with Wifi, Bluetooth and a mic) to a small DAC + amplifier + speaker.
3. Connect the Seeed Sense to a battery.
4. Configure the ESP32 to be a wireless mic + speaker for the Jetson or Pi.
5. Connect the Jetson or Pi to a battery.
6. Make a rudimentary case for the Seeed Sense + speaker. Optional.
7. Make a rudimentary case for the Jetson or Pi. Optional.
Misc.
1. Develop default system message for executive assistant.
2. Determine recommended minimal hardware for the light & heavy.