feat(toolkit): add classification and migration

Implement the standard/non-standard/not-macrolactone classification layer
and integrate it into analyzer, fragmenter, and CLI outputs.

Port the remaining legacy package capabilities into new visualization and
workflow modules, restore batch/statistics/SDF scripts on top of the flat
CSV workflow, and update active docs to the new package API.
This commit is contained in:
2026-03-18 23:56:41 +08:00
parent 9ccbcfcd04
commit c0ead42384
24 changed files with 1497 additions and 313 deletions

View File

@@ -1,11 +1,18 @@
# scripts
这些脚本现在都是基于 `macro_lactone_toolkit.*`薄封装或迁移提示
这些脚本基于 `macro_lactone_toolkit.*`正式包接口,不再依赖旧的 `src.*` 模块
- `batch_process.py`: 等价于 `macro-lactone-toolkit fragment`
- `batch_process_ring16.py`: 等价于 `macro-lactone-toolkit fragment --ring-size 16`
- `batch_process_multi_rings.py`: 自动识别模式的批处理封装
- `analyze_fragments.py`: 等价于 `macro-lactone-toolkit analyze`
- `batch_process.py`: 读取分子 CSV输出 flat `fragments.csv``errors.csv` 和处理摘要 JSON
- `batch_process_ring16.py`: 固定 `--ring-size 16` 的批处理入口
- `batch_process_multi_rings.py`: 自动识别 12-20 元环的批处理入口
- `analyze_fragments.py`: 读取 flat fragment CSV生成位置统计、性质汇总和频率图
- `generate_sdf_and_statistics.py`: 读取 flat fragment CSV生成 cleavage 统计 JSON 和 3D SDF
- `tylosin_splicer.py`: 使用 `macro_lactone_toolkit.splicing.*` 做简单拼接
核心实现与正式接口都在 `src/macro_lactone_toolkit/` 中。
推荐工作流:
```bash
python scripts/batch_process.py --input molecules.csv --output fragments.csv --errors-output errors.csv
python scripts/analyze_fragments.py --input fragments.csv --output-dir analysis
python scripts/generate_sdf_and_statistics.py --input fragments.csv --output-dir sdf_output
```