Commit Graph

17 Commits

Author SHA1 Message Date
Arthur Colle
92f5b57da3 Initial release: OpenHarmony-MLX - High-Performance Apple Silicon GPT-OSS Implementation
This is a complete rebranding and optimization of the original GPT-OSS codebase for Apple Silicon:

🚀 Features:
- Native MLX acceleration for M1/M2/M3/M4 chips
- Complete MLX implementation with Mixture of Experts (MoE)
- Memory-efficient quantization (4-bit MXFP4)
- Drop-in replacement APIs for existing backends
- Full tool integration (browser, python, apply_patch)
- Comprehensive build system with Metal kernels

📦 What's Included:
- gpt_oss/mlx_gpt_oss/ - Complete MLX implementation
- All original inference backends (torch, triton, metal, vllm)
- Command-line interfaces and Python APIs
- Developer tools and evaluation suite
- Updated branding and documentation

🍎 Apple Silicon Optimized:
- Up to 40 tokens/sec performance on Apple Silicon
- Run GPT-OSS-120b in 30GB with quantization
- Native Metal kernel acceleration
- Memory-mapped weight loading

🔧 Ready to Deploy:
- Updated package name to openharmony-mlx
- Comprehensive .gitignore for clean releases
- Updated README with Apple Silicon focus
- All build artifacts cleaned up

🧠 Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
2025-08-06 19:28:25 -04:00
Scott Lessans
f1774c5110 fix ci/pypi (#30) 2025-08-05 17:44:04 -07:00
3n0chK4n
d8db548846 Fix TOML parsing errors in pyproject.toml for scikit-build configuration (#27)
* Update wheel configuration in pyproject.toml to include package tree

* Added python dependency and shell globbing for metal command
2025-08-05 16:15:12 -07:00
cwhitelam
0a8f5f29d3 Correct small grammar issues for better comprehension (#21)
* Correct small grammar issues for better comprehension

* Update README.md

---------

Co-authored-by: Christopher Whitelam <cwhitelam@northeastprecast.com>
Co-authored-by: Dominik Kundel <dkundel@openai.com>
2025-08-05 14:28:41 -07:00
Ricky Saull
51bfa9ed24 Fix typos and grammar in README (#6)
Corrected several typos and updated all references from 'Pytorch' to 'PyTorch' for consistency. Improved clarity in model descriptions and usage instructions throughout the README.
2025-08-05 11:29:13 -07:00
Mohammad Miadh Angkad
1e47b70436 docs: Fix another extra "= messages" (#7)
Fix another extra "= messages" in README
2025-08-05 11:28:29 -07:00
Zhuohan Li
3e3c828598 Merge pull request #16 from openai/zhuohan/fix-pypi-ci
Try fix pypi ci
2025-08-05 11:27:46 -07:00
Li Yang
9e5b841987 readme: fix python tool ref (#10) 2025-08-05 11:27:31 -07:00
Zhuohan Li
3b035f3fa9 build triton from scratch 2025-08-05 11:27:09 -07:00
Romain Huet
04257d5a8f Update README 2025-08-05 11:22:04 -07:00
Zhuohan Li
dcb122a024 Update harmony link 2025-08-05 10:44:43 -07:00
Ahmet Dedeler
71a363c21d docs: Fix extra "= messages" (#3) 2025-08-05 10:24:58 -07:00
Zhuohan Li
3e6491a306 Add instruction to download metal weights 2025-08-05 10:21:53 -07:00
Nate Chan
3832a82d8b first fix — readme (#2) 2025-08-05 10:16:15 -07:00
Lysandre Debut
a601a63cdc Transformers responses API (#1) 2025-08-05 10:02:16 -07:00
Romain Huet
fa610607d1 Update vLLM command in README 2025-08-05 09:17:53 -07:00
Dominik Kundel
243a1b0276 Initial commit
Co-authored-by: Zhuohan Li <zhuohan@openai.com>
Co-authored-by: Maratyszcza <marat@openai.com>
Co-authored-by: Volodymyr Kyrylov <vol@wilab.org.ua>
2025-08-05 08:19:49 -07:00