86 Commits (main)
 

Author SHA1 Message Date
Artem-Darius Weber d57debe0d4 feat: compleate docker compose for windows machine
2 weeks ago
Artem-Darius Weber 3510af1fbd added Docker setup
2 weeks ago
thinhlpg 20fb6779c3 docs: add GGUF
3 weeks ago
thinhlpg 7cd4d18ee6 feat: enhance model loading with fallback to Hugging Face repo and improved error handling
3 weeks ago
thinhlpg bac5f3b4f7 feat: update config and paths, update data genenration script
3 weeks ago
thinhlpg bd1d7ced3b docs: update project description and authors in pyproject.toml; reorganize demo section in README
3 weeks ago
thinhlpg 647f7781d5 fix: change data to correct format
3 weeks ago
thinhlpg 62cc8137bf docs: add Experiments section to README with detailed run information
4 weeks ago
thinhlpg c153652856 docs: update README with new image
4 weeks ago
thinhlpg ad18169d77 docs: enhance README with demo GIF
4 weeks ago
thinhlpg 89e07bc02d chore: chore: remove unused code and dependencies
4 weeks ago
thinhlpg 5eabd121a3 docs: update README
4 weeks ago
thinhlpg 0b4bf54833 feat: update demo from DeepSearch to ReZero, adjusting related logging and UI components
4 weeks ago
thinhlpg 9738b80353 feat: update max generations and output length in evaluation scripts, add memory fraction to server launch
4 weeks ago
thinhlpg 7ee65269fb feat: add new evaluation notebook for model testing and checkpoint evaluation
4 weeks ago
thinhlpg bec864038b feat: increase max tokens and new tokens in evaluation scripts
4 weeks ago
thinhlpg dfa420fa49 feat: expand Makefile with serving and evaluation commands
4 weeks ago
thinhlpg 6ba963aca3 feat: streamline data preparation in Makefile with a single command
4 weeks ago
thinhlpg 424459d840 feat: update evaluation scripts to enhance model configuration and dataset loading, including increased max tokens and added logging
4 weeks ago
thinhlpg bf9f2c4102 docs: update README with setup instructions, quick demo, and data preparation steps for better clarity and usability
4 weeks ago
thinhlpg d7cdb6c917 chore: remove unused scripts
4 weeks ago
thinhlpg 1e7514f98e chore: remove outdated documentation files to clean up project structure
4 weeks ago
thinhlpg 333d1e596e feat: add prepare-dev-data target and script for Musique dev data transformation
4 weeks ago
thinhlpg 504f0c6c8e feat: update reward_em_chunk to match only the LAST required paragraph of the reasoning chain and adjust related tests
4 weeks ago
thinhlpg 358875a035 feat: enhance reward_em_chunk function to match multiple paragraphs, add test
4 weeks ago
thinhlpg 2df9f39fda feat: update model configuration (longer context) and dataset loading logic for improved performance and flexibility
4 weeks ago
thinhlpg 4a1d45271d feat: add scripts for musique data processing
4 weeks ago
thinhlpg 74aa673866 chores: add cook notebook for musique and model reasoning pattern
4 weeks ago
thinhlpg 14ef79a4f5 feat: [WIP] add bench scripts
4 weeks ago
thinhlpg bd02305efb chores: add cook notebooks
1 month ago
thinhlpg d8e949ec7c feat: add Tavily search tab and integrate TavilyClient for web search functionality
1 month ago
thinhlpg 41b7889a30 feat: integrate QA dataset loading and display gold answers in Gradio interface
1 month ago
thinhlpg 7376f596a5 feat: add Gradio demo for DeepSearch and update configuration settings
1 month ago
thinhlpg 7ff3623102 chore: update .gitignore, modify Makefile for installation, and add pyproject.toml for project configuration
1 month ago
thinhlpg eebf914a81 refactor: moved modules from src/deepsearch to src/
1 month ago
thinhlpg 0f662d4330 refactor: moved FlashRAG submodule from src/ to third_party/
1 month ago
thinhlpg 55f34b8503 feat: add FlashRAG as submodule
1 month ago
thinhlpg 2fec4f2f42 refactor: change repo stucture (move code from src/ to src/deepsearch)
1 month ago
thinhlpg e3163081a0 docs: add experiment log for llama-3.2-3b-instruct experiments
1 month ago
thinhlpg 010957cd99 feat: disable randomization option to get_qa_dataset function by default
1 month ago
automaticcat 56911a73f9 Update README.md
1 month ago
thinhlpg 1a18cd7bfd feat: update training and evaluation configurations (editable agent generation scripts)
1 month ago
thinhlpg 77f121662f test: add tests for reward_retry function scenarios
1 month ago
thinhlpg c8714e0f6b feat: enhance reward_retry function to handle missing answer tags
1 month ago
thinhlpg bf480574a2 fix: minor bug
1 month ago
thinhlpg 3081d6e36b test: added tests for new reward functions: search strategy and search diversity
1 month ago
thinhlpg 4de31e0f30 feat: expand reward functions with new strategies and diversity checks
1 month ago
thinhlpg d0e6068055 fix: strengthen reward correctness logic to handle final message is not asnwer form assistant. Also update logs for reward functions for better debug
1 month ago
thinhlpg 1bd609dfae test: enhance reward correctness tests with validation logic
1 month ago
thinhlpg 338655e563 feat: refine user prompt logic for improved clarity and structure
1 month ago