Files
macrolactone-toolkit/scripts
lingyuzeng 3e07402f4e feat(numbering): publish canonical numbering API
Add a public numbering module and route fragmenting, validation,
and scaffold preparation through the canonical numbering entry.

Rewrite the repository entry docs around the fixed numbering
contract, add MkDocs landing pages, and document the mirror
mapping used for medicinal-chemistry comparisons.

Also refresh the validation analysis reports to explain the
canonical-versus-mirrored numbering relationship.
2026-03-20 15:14:31 +08:00
..
2025-11-14 20:34:58 +08:00

scripts

这些脚本都基于 macro_lactone_toolkit.* 的正式包接口,不再依赖旧的 src.* 模块。

  • batch_process.py: 读取分子 CSV输出 flat fragments.csverrors.csv 和处理摘要 JSON
  • batch_process_ring16.py: 固定 --ring-size 16 的批处理入口
  • batch_process_multi_rings.py: 自动识别 12-20 元环的批处理入口
  • analyze_fragments.py: 读取 flat fragment CSV生成位置统计、性质汇总和频率图
  • generate_sdf_and_statistics.py: 读取 flat fragment CSV生成 cleavage 统计 JSON 和 3D SDF
  • tylosin_splicer.py: 使用 macro_lactone_toolkit.splicing.* 做简单拼接

推荐工作流:

python scripts/batch_process.py --input molecules.csv --output fragments.csv --errors-output errors.csv
python scripts/analyze_fragments.py --input fragments.csv --output-dir analysis
python scripts/generate_sdf_and_statistics.py --input fragments.csv --output-dir sdf_output