openharmony-mlx

Author	SHA1	Message	Date
hotwa	9974fc7a00	uncensored 版本的权重转化	2025-10-08 19:57:47 +08:00
hotwa	7270797bd4	add host args	2025-09-02 22:40:15 +08:00
Arthur Colle	92f5b57da3	Initial release: OpenHarmony-MLX - High-Performance Apple Silicon GPT-OSS Implementation This is a complete rebranding and optimization of the original GPT-OSS codebase for Apple Silicon: 🚀 Features: - Native MLX acceleration for M1/M2/M3/M4 chips - Complete MLX implementation with Mixture of Experts (MoE) - Memory-efficient quantization (4-bit MXFP4) - Drop-in replacement APIs for existing backends - Full tool integration (browser, python, apply_patch) - Comprehensive build system with Metal kernels 📦 What's Included: - gpt_oss/mlx_gpt_oss/ - Complete MLX implementation - All original inference backends (torch, triton, metal, vllm) - Command-line interfaces and Python APIs - Developer tools and evaluation suite - Updated branding and documentation 🍎 Apple Silicon Optimized: - Up to 40 tokens/sec performance on Apple Silicon - Run GPT-OSS-120b in 30GB with quantization - Native Metal kernel acceleration - Memory-mapped weight loading 🔧 Ready to Deploy: - Updated package name to openharmony-mlx - Comprehensive .gitignore for clean releases - Updated README with Apple Silicon focus - All build artifacts cleaned up 🧠 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-06 19:28:25 -04:00
Zhuohan Li	ba7d80ab89	Fix chat demo (#26 )	2025-08-05 14:35:56 -07:00
Jack Clayton	8fe4ee2088	Fix import for metal example (#24 ) It was previously pointing to an empty __init__.py. Also remove unused date import.	2025-08-05 14:24:18 -07:00
Lysandre Debut	a601a63cdc	Transformers responses API (#1 )	2025-08-05 10:02:16 -07:00
Dominik Kundel	243a1b0276	Initial commit Co-authored-by: Zhuohan Li <zhuohan@openai.com> Co-authored-by: Maratyszcza <marat@openai.com> Co-authored-by: Volodymyr Kyrylov <vol@wilab.org.ua>	2025-08-05 08:19:49 -07:00

7 Commits