Commit Graph

44 Commits

Author SHA1 Message Date
9974fc7a00 uncensored 版本的权重转化 2025-10-08 19:57:47 +08:00
753b3f9dc8 add use description 2025-10-08 19:57:03 +08:00
e8e7ca5eba codex examples 2025-10-08 11:12:00 +08:00
442c5cc7f0 更新用法 2025-10-08 11:11:46 +08:00
cb564f6233 update config 2025-09-02 22:53:28 +08:00
0c54e0ed7c usage note 2025-09-02 22:50:31 +08:00
7653ae304b Merge branch 'master' of https://github.com/hotwa/openharmony-mlx 2025-09-02 22:43:36 +08:00
7270797bd4 add host args 2025-09-02 22:40:15 +08:00
Zeng Lingyu
90933a673d Update pyproject.toml 2025-08-13 21:05:07 +08:00
Arthur Colle
92f5b57da3 Initial release: OpenHarmony-MLX - High-Performance Apple Silicon GPT-OSS Implementation
This is a complete rebranding and optimization of the original GPT-OSS codebase for Apple Silicon:

🚀 Features:
- Native MLX acceleration for M1/M2/M3/M4 chips
- Complete MLX implementation with Mixture of Experts (MoE)
- Memory-efficient quantization (4-bit MXFP4)
- Drop-in replacement APIs for existing backends
- Full tool integration (browser, python, apply_patch)
- Comprehensive build system with Metal kernels

📦 What's Included:
- gpt_oss/mlx_gpt_oss/ - Complete MLX implementation
- All original inference backends (torch, triton, metal, vllm)
- Command-line interfaces and Python APIs
- Developer tools and evaluation suite
- Updated branding and documentation

🍎 Apple Silicon Optimized:
- Up to 40 tokens/sec performance on Apple Silicon
- Run GPT-OSS-120b in 30GB with quantization
- Native Metal kernel acceleration
- Memory-mapped weight loading

🔧 Ready to Deploy:
- Updated package name to openharmony-mlx
- Comprehensive .gitignore for clean releases
- Updated README with Apple Silicon focus
- All build artifacts cleaned up

🧠 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
2025-08-06 19:28:25 -04:00
Scott Lessans
4931694686 fix build 2025-08-05 17:52:43 -07:00
Scott Lessans
f1774c5110 fix ci/pypi (#30) 2025-08-05 17:44:04 -07:00
3n0chK4n
d8db548846 Fix TOML parsing errors in pyproject.toml for scikit-build configuration (#27)
* Update wheel configuration in pyproject.toml to include package tree

* Added python dependency and shell globbing for metal command
2025-08-05 16:15:12 -07:00
draczer01
a6d9d90ab5 set plataform for CI porpuses (#18)
used a specific plataform accepted by pypl
2025-08-05 14:42:46 -07:00
Zhuohan Li
ba7d80ab89 Fix chat demo (#26) 2025-08-05 14:35:56 -07:00
mkusaka
3a68b4f565 fix: Correct multiple documentation URLs (#17)
- Fix OpenAI Cookbook NVIDIA article URL (remove incorrect gpt-oss/ prefix)
- Fix Groq Responses API documentation URL (responses → responses-api)
2025-08-05 14:29:10 -07:00
cwhitelam
0a8f5f29d3 Correct small grammar issues for better comprehension (#21)
* Correct small grammar issues for better comprehension

* Update README.md

---------

Co-authored-by: Christopher Whitelam <cwhitelam@northeastprecast.com>
Co-authored-by: Dominik Kundel <dkundel@openai.com>
2025-08-05 14:28:41 -07:00
Niles Burbank
246e377ba5 Add some additional links to awesome-gpt-oss.md (#22)
Add links to relevant AMD resources
2025-08-05 14:27:09 -07:00
Jack Clayton
8fe4ee2088 Fix import for metal example (#24)
It was previously pointing to an empty __init__.py.

Also remove unused date import.
2025-08-05 14:24:18 -07:00
Mihajlo Micic
89fe402d10 Add comprehensive test suite for Responses API (#20)
The project had almost no test coverage - just a single test checking if the API returns 200. This adds proper testing infrastructure and 21 new tests covering the main API functionality.

Tests now cover response creation, error handling, tools, sessions, performance, and usage tracking. All tests passing.
2025-08-05 12:50:06 -07:00
Dominik Kundel
9074326499 Update LICENSE 2025-08-05 11:44:29 -07:00
Ricky Saull
51bfa9ed24 Fix typos and grammar in README (#6)
Corrected several typos and updated all references from 'Pytorch' to 'PyTorch' for consistency. Improved clarity in model descriptions and usage instructions throughout the README.
2025-08-05 11:29:13 -07:00
Mohammad Miadh Angkad
1e47b70436 docs: Fix another extra "= messages" (#7)
Fix another extra "= messages" in README
2025-08-05 11:28:29 -07:00
Zhuohan Li
3e3c828598 Merge pull request #16 from openai/zhuohan/fix-pypi-ci
Try fix pypi ci
2025-08-05 11:27:46 -07:00
Li Yang
9e5b841987 readme: fix python tool ref (#10) 2025-08-05 11:27:31 -07:00
Zhuohan Li
3b035f3fa9 build triton from scratch 2025-08-05 11:27:09 -07:00
Sumit Aryal
08e50b3243 Python Agents SDK Example (#14) 2025-08-05 11:26:02 -07:00
mkusaka
f615ce39b3 fix: Correct broken links in awesome-gpt-oss.md (#12)
- Fix HTTP to HTTPS for Hugging Face blog link
- Fix Groq blog link: HTTP to HTTPS, add /blog/ path, fix typo (open-model → open-models)
- Fix TensorRT-LLM documentation filename (blog_9 to blog9)
2025-08-05 11:24:13 -07:00
Zhuohan Li
0f0336796c Try fix pypi ci (#13) 2025-08-05 11:23:56 -07:00
Romain Huet
04257d5a8f Update README 2025-08-05 11:22:04 -07:00
Zhuohan Li
3f71c48945 Try fix pypi ci 2025-08-05 11:20:00 -07:00
Zhuohan Li
b5cc884079 Update harmony link (#8)
Update harmony link
2025-08-05 10:45:08 -07:00
Zhuohan Li
dcb122a024 Update harmony link 2025-08-05 10:44:43 -07:00
Simon Mo
c8f0d136cc feat: add a reference mcp servers (#5)
Signed-off-by: simon-mo <simon.mo@hey.com>
2025-08-05 10:41:01 -07:00
Ahmet Dedeler
71a363c21d docs: Fix extra "= messages" (#3) 2025-08-05 10:24:58 -07:00
Zhuohan Li
7ab2f7c4c3 Add instruction to download metal weights
Add instruction to download metal weights
2025-08-05 10:22:26 -07:00
Zhuohan Li
3e6491a306 Add instruction to download metal weights 2025-08-05 10:21:53 -07:00
Nate Chan
3832a82d8b first fix — readme (#2) 2025-08-05 10:16:15 -07:00
Lysandre Debut
a601a63cdc Transformers responses API (#1) 2025-08-05 10:02:16 -07:00
Dominik Kundel
0106ce5ba3 Correct version 2025-08-05 09:33:17 -07:00
Dominik Kundel
481c8eea95 add CI 2025-08-05 09:31:47 -07:00
Romain Huet
fa610607d1 Update vLLM command in README 2025-08-05 09:17:53 -07:00
Zhuohan Li
30e7277352 Add USAGE_POLICY 2025-08-05 08:55:22 -07:00
Dominik Kundel
243a1b0276 Initial commit
Co-authored-by: Zhuohan Li <zhuohan@openai.com>
Co-authored-by: Maratyszcza <marat@openai.com>
Co-authored-by: Volodymyr Kyrylov <vol@wilab.org.ua>
2025-08-05 08:19:49 -07:00