重构分子属性分析流程并更新目录结构

1. 目录结构调整:
   - 创建scripts目录统一存放分析脚本
   - 保持数据文件在原有目录结构中
   - 生成的CSV文件和PNG图表文件也放在scripts目录下

2. 功能改进:
   - 更新calculate_qed_values.py脚本,添加对参考分子SDF文件的处理
   - 修改analyze_qed_mw_distribution.py脚本,统一使用SDF文件stem名称作为参考分子标识符
   - 改进Vina得分提取逻辑,支持从SDF文件中提取所有构象的得分
   - 完善KDE分布图绘制,确保参考分子在所有图表中显示统一的名称

3. 文档更新:
   - 更新README.md中的目录结构说明
   - 更新命令行和API使用示例
   - 添加详细的使用说明和示例

4. 示例代码:
   - 更新example_api_usage.py以适应新的目录结构和API调用方式
This commit is contained in:
2025-08-05 17:00:39 +08:00
parent d68b536aec
commit f6c182f38e
18 changed files with 85692 additions and 32 deletions

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,6 @@
center_x = -12.7
center_y = -9.1
center_z = -0.3
size_x = 49.1
size_y = 37.6
size_z = 35.2

View File

@@ -0,0 +1,334 @@
# created with PyMOL 2.5.5
@<TRIPOS>MOLECULE
9NY
161 163 1
SMALL
USER_CHARGES
@<TRIPOS>ATOM
1 P1 -23.423 0.058 11.015 P.3 1 9NY9N 0.000
2 P2 -21.242 1.855 7.472 P.3 1 9NY9N 0.000
3 P3 -19.600 -0.011 6.747 P.3 1 9NY9N 0.000
4 C1 -28.864 2.870 10.743 C.2 1 9NY9N 0.000
5 N1 -31.253 1.944 11.759 N.2 1 9NY9N 0.000
6 O1 -25.238 2.149 11.667 O.3 1 9NY9N 0.000
7 S1 -7.252 -2.141 3.517 S.3 1 9NY9N 0.000
8 C2 -30.013 3.545 10.637 C.2 1 9NY9N 0.000
9 N2 -28.911 1.691 11.383 N.2 1 9NY9N 0.000
10 O2 -23.605 1.182 9.863 O.3 1 9NY9N 0.000
11 C3 -31.160 3.083 11.121 C.2 1 9NY9N 0.000
12 N3 -32.216 3.846 10.961 N.pl3 1 9NY9N 0.000
13 O3 -25.860 3.427 8.461 O.3 1 9NY9N 0.000
14 C4 -28.454 4.678 9.686 C.2 1 9NY9N 0.000
15 N4 -29.740 4.668 9.999 N.2 1 9NY9N 0.000
16 O4 -22.481 2.912 7.175 O.3 1 9NY9N 0.000
17 C5 -26.321 3.400 9.849 C.3 1 9NY9N 0.000
18 N5 -27.867 3.571 10.130 N.2 1 9NY9N 0.000
19 O5 -23.831 0.826 12.359 O.3 1 9NY9N 0.000
20 C6 -30.117 1.201 11.903 C.2 1 9NY9N 0.000
21 N6 -14.933 -4.164 4.827 N.pl3 1 9NY9N 0.000
22 O6 -24.603 -0.926 10.799 O.3 1 9NY9N 0.000
23 C7 -25.814 2.074 10.338 C.3 1 9NY9N 0.000
24 N7 -10.491 -4.133 3.270 N.pl3 1 9NY9N 0.000
25 O7 -22.106 -0.581 11.006 O.co2 1 9NY9N 0.000
26 C8 -24.812 1.735 9.301 C.3 1 9NY9N 0.000
27 O8 -20.484 1.176 6.407 O.3 1 9NY9N 0.000
28 C9 -24.590 3.025 8.600 C.3 1 9NY9N 0.000
29 O9 -20.437 2.908 7.887 O.co2 1 9NY9N 0.000
30 C10 -23.957 2.827 7.234 C.3 1 9NY9N 0.000
31 O10 -21.382 0.839 8.488 O.3 1 9NY9N 0.000
32 C11 -18.398 -3.473 5.784 C.3 1 9NY9N 0.000
33 O11 -19.708 -1.357 6.005 O.3 1 9NY9N 0.000
34 C12 0.049 -11.148 -9.157 C.3 1 9NY9N 0.000
35 O12 -16.857 -5.083 6.332 O.3 1 9NY9N 0.000
36 C13 -1.382 -10.609 -8.856 C.3 1 9NY9N 0.000
37 O13 -18.555 0.516 5.701 O.co2 1 9NY9N 0.000
38 C14 0.168 -12.579 -9.636 C.3 1 9NY9N 0.000
39 O14 -19.441 -0.714 8.058 O.3 1 9NY9N 0.000
40 C15 -1.587 -9.931 -7.421 C.3 1 9NY9N 0.000
41 O15 -16.659 -3.840 3.627 O.2 1 9NY9N 0.000
42 C16 1.031 -13.496 -8.823 C.3 1 9NY9N 0.000
43 O16 -6.929 -0.607 1.206 O.2 1 9NY9N 0.000
44 C17 -1.554 -8.327 -7.282 C.3 1 9NY9N 0.000
45 O17 -11.473 -2.206 3.218 O.2 1 9NY9N 0.000
46 C18 1.986 -14.241 -9.702 C.3 1 9NY9N 0.000
47 C19 -1.099 -7.665 -5.944 C.3 1 9NY9N 0.000
48 C20 2.709 -15.391 -9.040 C.3 1 9NY9N 0.000
49 C21 -2.242 -7.507 -4.994 C.3 1 9NY9N 0.000
50 C22 3.177 -16.279 -10.094 C.3 1 9NY9N 0.000
51 C23 -2.078 -6.266 -4.169 C.3 1 9NY9N 0.000
52 C24 -18.455 -2.047 6.028 C.3 1 9NY9N 0.000
53 C25 -16.941 -3.861 5.857 C.3 1 9NY9N 0.000
54 C26 3.832 -17.476 -9.633 C.3 1 9NY9N 0.000
55 C27 -3.244 -6.133 -3.153 C.3 1 9NY9N 0.000
56 C28 4.432 -17.984 -10.877 C.3 1 9NY9N 0.000
57 C29 -3.452 -4.696 -2.551 C.3 1 9NY9N 0.000
58 C30 -19.144 -3.976 6.972 C.3 1 9NY9N 0.000
59 C31 -19.120 -3.952 4.596 C.3 1 9NY9N 0.000
60 C32 5.079 -19.250 -10.557 C.3 1 9NY9N 0.000
61 C33 -4.861 -4.542 -1.890 C.3 1 9NY9N 0.000
62 C34 6.082 -19.523 -11.577 C.3 1 9NY9N 0.000
63 C35 -5.428 -3.121 -1.772 C.3 1 9NY9N 0.000
64 C36 -16.175 -3.962 4.669 C.2 1 9NY9N 0.000
65 C37 6.602 -20.920 -11.418 C.3 1 9NY9N 0.000
66 C38 -5.056 -2.340 -0.460 C.3 1 9NY9N 0.000
67 C39 6.912 -21.512 -12.748 C.3 1 9NY9N 0.000
68 C40 -5.766 -2.711 0.889 C.3 1 9NY9N 0.000
69 C41 -13.968 -4.157 3.756 C.3 1 9NY9N 0.000
70 C42 6.539 -22.935 -12.884 C.3 1 9NY9N 0.000
71 C43 -12.551 -3.951 4.418 C.3 1 9NY9N 0.000
72 C44 -6.629 -1.663 1.695 C.2 1 9NY9N 0.000
73 C45 -11.469 -3.356 3.552 C.2 1 9NY9N 0.000
74 C46 -8.022 -3.771 3.135 C.3 1 9NY9N 0.000
75 C47 -9.401 -3.632 2.426 C.3 1 9NY9N 0.000
76 H01 -24.998 1.268 11.962 H 1 9NY9N 0.000
77 H02 -33.112 3.550 11.320 H 1 9NY9N 0.000
78 H03 -32.132 4.729 10.479 H 1 9NY9N 0.000
79 H04 -27.950 5.477 9.143 H 1 9NY9N 0.000
80 H05 -25.942 4.281 10.368 H 1 9NY9N 0.000
81 H06 -23.757 1.773 12.222 H 1 9NY9N 0.000
82 H07 -30.147 0.239 12.416 H 1 9NY9N 0.000
83 H08 -14.598 -4.341 5.763 H 1 9NY9N 0.000
84 H09 -24.859 -1.310 11.641 H 1 9NY9N 0.000
85 H10 -26.596 1.324 10.454 H 1 9NY9N 0.000
86 H11 -10.470 -5.079 3.624 H 1 9NY9N 0.000
87 H12 -25.152 0.952 8.623 H 1 9NY9N 0.000
88 H13 -23.922 3.711 9.121 H 1 9NY9N 0.000
89 H14 -24.200 1.804 6.948 H 1 9NY9N 0.000
90 H15 -24.361 3.589 6.567 H 1 9NY9N 0.000
91 H16 -21.113 1.196 9.338 H 1 9NY9N 0.000
92 H17 0.560 -11.133 -8.194 H 1 9NY9N 0.000
93 H18 0.485 -10.508 -9.924 H 1 9NY9N 0.000
94 H19 -16.600 -5.683 5.628 H 1 9NY9N 0.000
95 H20 -2.033 -11.483 -8.864 H 1 9NY9N 0.000
96 H21 -1.626 -9.866 -9.615 H 1 9NY9N 0.000
97 H22 0.659 -12.514 -10.607 H 1 9NY9N 0.000
98 H23 -0.835 -13.004 -9.659 H 1 9NY9N 0.000
99 H24 -19.068 -1.586 7.911 H 1 9NY9N 0.000
100 H25 -0.707 -10.251 -6.863 H 1 9NY9N 0.000
101 H26 -2.558 -10.261 -7.051 H 1 9NY9N 0.000
102 H27 0.392 -14.216 -8.311 H 1 9NY9N 0.000
103 H28 1.595 -12.909 -8.098 H 1 9NY9N 0.000
104 H29 -2.606 -8.050 -7.351 H 1 9NY9N 0.000
105 H30 -0.885 -7.958 -8.059 H 1 9NY9N 0.000
106 H31 2.756 -13.524 -9.986 H 1 9NY9N 0.000
107 H32 1.426 -14.637 -10.549 H 1 9NY9N 0.000
108 H33 -0.703 -6.675 -6.170 H 1 9NY9N 0.000
109 H34 -0.336 -8.290 -5.480 H 1 9NY9N 0.000
110 H35 2.043 -15.921 -8.359 H 1 9NY9N 0.000
111 H36 3.556 -15.017 -8.464 H 1 9NY9N 0.000
112 H37 -2.265 -8.370 -4.328 H 1 9NY9N 0.000
113 H38 -3.172 -7.441 -5.559 H 1 9NY9N 0.000
114 H39 3.925 -15.719 -10.655 H 1 9NY9N 0.000
115 H40 2.317 -16.573 -10.696 H 1 9NY9N 0.000
116 H41 -2.085 -5.399 -4.830 H 1 9NY9N 0.000
117 H42 -1.133 -6.316 -3.628 H 1 9NY9N 0.000
118 H43 -17.930 -1.630 5.169 H 1 9NY9N 0.000
119 H44 -18.024 -1.881 7.015 H 1 9NY9N 0.000
120 H45 -16.540 -3.033 6.441 H 1 9NY9N 0.000
121 H46 3.165 -18.184 -9.141 H 1 9NY9N 0.000
122 H47 4.594 -17.251 -8.886 H 1 9NY9N 0.000
123 H48 -2.985 -6.776 -2.312 H 1 9NY9N 0.000
124 H49 -4.165 -6.424 -3.657 H 1 9NY9N 0.000
125 H50 5.153 -17.272 -11.278 H 1 9NY9N 0.000
126 H51 3.659 -18.139 -11.630 H 1 9NY9N 0.000
127 H52 -3.379 -3.974 -3.364 H 1 9NY9N 0.000
128 H53 -2.686 -4.514 -1.797 H 1 9NY9N 0.000
129 H54 -20.138 -4.301 6.665 H 1 9NY9N 0.000
130 H55 -18.607 -4.817 7.410 H 1 9NY9N 0.000
131 H56 -19.234 -3.178 7.710 H 1 9NY9N 0.000
132 H57 -20.022 -3.357 4.455 H 1 9NY9N 0.000
133 H58 -18.481 -3.855 3.718 H 1 9NY9N 0.000
134 H59 -19.393 -4.998 4.733 H 1 9NY9N 0.000
135 H60 4.341 -20.052 -10.532 H 1 9NY9N 0.000
136 H61 5.556 -19.184 -9.579 H 1 9NY9N 0.000
137 H62 -4.727 -4.877 -0.862 H 1 9NY9N 0.000
138 H63 -5.566 -5.129 -2.478 H 1 9NY9N 0.000
139 H64 6.906 -18.818 -11.468 H 1 9NY9N 0.000
140 H65 5.634 -19.413 -12.565 H 1 9NY9N 0.000
141 H66 -6.512 -3.237 -1.748 H 1 9NY9N 0.000
142 H67 -5.062 -2.543 -2.621 H 1 9NY9N 0.000
143 H68 5.840 -21.527 -10.929 H 1 9NY9N 0.000
144 H69 7.508 -20.900 -10.813 H 1 9NY9N 0.000
145 H70 -5.402 -1.326 -0.660 H 1 9NY9N 0.000
146 H71 -3.986 -2.475 -0.303 H 1 9NY9N 0.000
147 H72 7.994 -21.462 -12.866 H 1 9NY9N 0.000
148 H73 6.376 -20.944 -13.508 H 1 9NY9N 0.000
149 H74 -4.919 -2.872 1.556 H 1 9NY9N 0.000
150 H75 -6.429 -3.547 0.666 H 1 9NY9N 0.000
151 H76 -14.184 -3.363 3.041 H 1 9NY9N 0.000
152 H77 -13.997 -5.104 3.218 H 1 9NY9N 0.000
153 H78 5.595 -23.013 -13.424 H 1 9NY9N 0.000
154 H79 7.316 -23.464 -13.435 H 1 9NY9N 0.000
155 H80 6.429 -23.378 -11.894 H 1 9NY9N 0.000
156 H81 -12.202 -4.956 4.655 H 1 9NY9N 0.000
157 H82 -12.682 -3.295 5.278 H 1 9NY9N 0.000
158 H83 -8.189 -4.286 4.081 H 1 9NY9N 0.000
159 H84 -7.350 -4.331 2.485 H 1 9NY9N 0.000
160 H85 -9.579 -2.575 2.228 H 1 9NY9N 0.000
161 H86 -9.381 -4.201 1.496 H 1 9NY9N 0.000
@<TRIPOS>BOND
1 1 10 1
2 1 19 1
3 1 22 1
4 1 25 2
5 2 16 1
6 2 27 1
7 2 29 2
8 2 31 1
9 3 27 1
10 3 33 1
11 3 37 2
12 3 39 1
13 4 8 ar
14 4 9 ar
15 4 18 ar
16 5 11 ar
17 5 20 ar
18 6 23 1
19 6 76 1
20 7 72 1
21 7 74 1
22 8 11 ar
23 8 15 ar
24 9 20 ar
25 10 26 1
26 11 12 1
27 12 77 1
28 12 78 1
29 13 17 1
30 13 28 1
31 14 15 ar
32 14 18 ar
33 14 79 1
34 16 30 1
35 17 18 1
36 17 23 1
37 17 80 1
38 19 81 1
39 20 82 1
40 21 64 1
41 21 69 1
42 21 83 1
43 22 84 1
44 23 26 1
45 23 85 1
46 24 73 1
47 24 75 1
48 24 86 1
49 26 28 1
50 26 87 1
51 28 30 1
52 28 88 1
53 30 89 1
54 30 90 1
55 31 91 1
56 32 52 1
57 32 53 1
58 32 58 1
59 32 59 1
60 33 52 1
61 34 36 1
62 34 38 1
63 34 92 1
64 34 93 1
65 35 53 1
66 35 94 1
67 36 40 1
68 36 95 1
69 36 96 1
70 38 42 1
71 38 97 1
72 38 98 1
73 39 99 1
74 40 44 1
75 40 100 1
76 40 101 1
77 41 64 2
78 42 46 1
79 42 102 1
80 42 103 1
81 43 72 2
82 44 47 1
83 44 104 1
84 44 105 1
85 45 73 2
86 46 48 1
87 46 106 1
88 46 107 1
89 47 49 1
90 47 108 1
91 47 109 1
92 48 50 1
93 48 110 1
94 48 111 1
95 49 51 1
96 49 112 1
97 49 113 1
98 50 54 1
99 50 114 1
100 50 115 1
101 51 55 1
102 51 116 1
103 51 117 1
104 52 118 1
105 52 119 1
106 53 64 1
107 53 120 1
108 54 56 1
109 54 121 1
110 54 122 1
111 55 57 1
112 55 123 1
113 55 124 1
114 56 60 1
115 56 125 1
116 56 126 1
117 57 61 1
118 57 127 1
119 57 128 1
120 58 129 1
121 58 130 1
122 58 131 1
123 59 132 1
124 59 133 1
125 59 134 1
126 60 62 1
127 60 135 1
128 60 136 1
129 61 63 1
130 61 137 1
131 61 138 1
132 62 65 1
133 62 139 1
134 62 140 1
135 63 66 1
136 63 141 1
137 63 142 1
138 65 67 1
139 65 143 1
140 65 144 1
141 66 68 1
142 66 145 1
143 66 146 1
144 67 70 1
145 67 147 1
146 67 148 1
147 68 72 1
148 68 149 1
149 68 150 1
150 69 71 1
151 69 151 1
152 69 152 1
153 70 153 1
154 70 154 1
155 70 155 1
156 71 73 1
157 71 156 1
158 71 157 1
159 74 75 1
160 74 158 1
161 74 159 1
162 75 160 1
163 75 161 1
@<TRIPOS>SUBSTRUCTURE
1 9NY9N 1 GROUP 1 **** 9NY

View File

@@ -0,0 +1,6 @@
center_x = 7.402
center_y = -4.783
center_z = -11.818
size_x = 30.000
size_y = 30.000
size_z = 30.000

File diff suppressed because it is too large Load Diff

View File

@@ -0,0 +1,64 @@
# created with PyMOL 2.5.5
@<TRIPOS>MOLECULE
lig_0GA
27 27 1
SMALL
USER_CHARGES
@<TRIPOS>ATOM
1 CAA 3.671 -4.365 -12.762 C.3 1 0GA604 0.000
2 CAG 4.543 -4.916 -13.859 C.3 1 0GA604 0.000
3 CAH 7.819 -6.866 -10.651 C.2 1 0GA604 0.000
4 CAI 7.083 -6.258 -11.655 C.2 1 0GA604 0.000
5 CAJ 8.615 -6.099 -9.808 C.2 1 0GA604 0.000
6 CAL 6.714 -5.330 -14.847 C.2 1 0GA604 0.000
7 CAM 9.522 -3.884 -9.056 C.2 1 0GA604 0.000
8 CAN 5.861 -4.806 -13.815 C.3 1 0GA604 0.000
9 CAO 7.917 -4.125 -10.939 C.2 1 0GA604 0.000
10 CAP 7.148 -4.871 -11.777 C.2 1 0GA604 0.000
11 CAQ 8.677 -4.728 -9.955 C.2 1 0GA604 0.000
12 OAB 7.818 -4.802 -15.048 O.co2 1 0GA604 0.000
13 OAC 9.439 -2.654 -9.176 O.co2 1 0GA604 0.000
14 OAD 6.325 -6.310 -15.475 O.co2 1 0GA604 0.000
15 OAE 10.263 -4.375 -8.185 O.co2 1 0GA604 0.000
16 OAF 7.929 -2.774 -11.171 O.3 1 0GA604 0.000
17 OAK 6.484 -4.156 -12.720 O.3 1 0GA604 0.000
18 H01 2.698 -4.855 -12.792 H 1 0GA604 0.000
19 H02 4.141 -4.550 -11.796 H 1 0GA604 0.000
20 H03 3.541 -3.292 -12.904 H 1 0GA604 0.000
21 H04 4.283 -4.281 -14.706 H 1 0GA604 0.000
22 H05 4.325 -5.982 -13.920 H 1 0GA604 0.000
23 H06 7.774 -7.947 -10.522 H 1 0GA604 0.000
24 H07 6.468 -6.850 -12.333 H 1 0GA604 0.000
25 H08 9.196 -6.585 -9.024 H 1 0GA604 0.000
26 H09 4.949 -5.078 -14.346 H 1 0GA604 0.000
27 H10 7.287 -2.345 -10.601 H 1 0GA604 0.000
@<TRIPOS>BOND
1 1 2 1
2 1 18 1
3 1 19 1
4 1 20 1
5 2 8 1
6 2 21 1
7 2 22 1
8 3 4 ar
9 3 5 ar
10 3 23 1
11 4 10 ar
12 4 24 1
13 5 11 ar
14 5 25 1
15 6 8 1
16 6 12 ar
17 6 14 ar
18 7 11 1
19 7 13 ar
20 7 15 ar
21 8 17 1
22 8 26 1
23 9 10 ar
24 9 11 ar
25 9 16 1
26 10 17 1
27 16 27 1
@<TRIPOS>SUBSTRUCTURE
1 0GA604 1 GROUP 1 A 0GA