openharmony-mlx

Author	SHA1	Message	Date
Arthur Colle	92f5b57da3	Initial release: OpenHarmony-MLX - High-Performance Apple Silicon GPT-OSS Implementation This is a complete rebranding and optimization of the original GPT-OSS codebase for Apple Silicon: 🚀 Features: - Native MLX acceleration for M1/M2/M3/M4 chips - Complete MLX implementation with Mixture of Experts (MoE) - Memory-efficient quantization (4-bit MXFP4) - Drop-in replacement APIs for existing backends - Full tool integration (browser, python, apply_patch) - Comprehensive build system with Metal kernels 📦 What's Included: - gpt_oss/mlx_gpt_oss/ - Complete MLX implementation - All original inference backends (torch, triton, metal, vllm) - Command-line interfaces and Python APIs - Developer tools and evaluation suite - Updated branding and documentation 🍎 Apple Silicon Optimized: - Up to 40 tokens/sec performance on Apple Silicon - Run GPT-OSS-120b in 30GB with quantization - Native Metal kernel acceleration - Memory-mapped weight loading 🔧 Ready to Deploy: - Updated package name to openharmony-mlx - Comprehensive .gitignore for clean releases - Updated README with Apple Silicon focus - All build artifacts cleaned up 🧠 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-06 19:28:25 -04:00
Scott Lessans	f1774c5110	fix ci/pypi (#30 )	2025-08-05 17:44:04 -07:00
3n0chK4n	d8db548846	Fix TOML parsing errors in pyproject.toml for scikit-build configuration (#27 ) * Update wheel configuration in pyproject.toml to include package tree * Added python dependency and shell globbing for metal command	2025-08-05 16:15:12 -07:00
cwhitelam	0a8f5f29d3	Correct small grammar issues for better comprehension (#21 ) * Correct small grammar issues for better comprehension * Update README.md --------- Co-authored-by: Christopher Whitelam <cwhitelam@northeastprecast.com> Co-authored-by: Dominik Kundel <dkundel@openai.com>	2025-08-05 14:28:41 -07:00
Ricky Saull	51bfa9ed24	Fix typos and grammar in README (#6 ) Corrected several typos and updated all references from 'Pytorch' to 'PyTorch' for consistency. Improved clarity in model descriptions and usage instructions throughout the README.	2025-08-05 11:29:13 -07:00
Mohammad Miadh Angkad	1e47b70436	docs: Fix another extra "= messages" (#7 ) Fix another extra "= messages" in README	2025-08-05 11:28:29 -07:00
Zhuohan Li	3e3c828598	Merge pull request #16 from openai/zhuohan/fix-pypi-ci Try fix pypi ci	2025-08-05 11:27:46 -07:00
Li Yang	9e5b841987	readme: fix python tool ref (#10 )	2025-08-05 11:27:31 -07:00
Zhuohan Li	3b035f3fa9	build triton from scratch	2025-08-05 11:27:09 -07:00
Romain Huet	04257d5a8f	Update README	2025-08-05 11:22:04 -07:00
Zhuohan Li	dcb122a024	Update harmony link	2025-08-05 10:44:43 -07:00
Ahmet Dedeler	71a363c21d	docs: Fix extra "= messages" (#3 )	2025-08-05 10:24:58 -07:00
Zhuohan Li	3e6491a306	Add instruction to download metal weights	2025-08-05 10:21:53 -07:00
Nate Chan	3832a82d8b	first fix — readme (#2 )	2025-08-05 10:16:15 -07:00
Lysandre Debut	a601a63cdc	Transformers responses API (#1 )	2025-08-05 10:02:16 -07:00
Romain Huet	fa610607d1	Update vLLM command in README	2025-08-05 09:17:53 -07:00
Dominik Kundel	243a1b0276	Initial commit Co-authored-by: Zhuohan Li <zhuohan@openai.com> Co-authored-by: Maratyszcza <marat@openai.com> Co-authored-by: Volodymyr Kyrylov <vol@wilab.org.ua>	2025-08-05 08:19:49 -07:00

17 Commits