A new "ndirect" mode for preprocessing #4

Cloudac7 · 2019-10-11T10:50:53Z

For some large, non-orthogonal box, it could be rather slow using direct mode to preprocess. So in this pr, a new mode ndirect, using numpy ndarray to produce neighbor list, is developed and has been tested to give the same result as the original direct mode.

txie-93

Thank you for your contribution! The code looks great. I have several very minor issues, mainly about formatting. The code should be ready to merge once you fix those. In addition, did you test if the new algorithm gives the exact same results as the existing two for the given trajectories?

txie-93 · 2019-10-17T03:14:30Z

README.md

@@ -96,7 +96,7 @@ Then, you can use the `preprocess.py` to preprocess the `traj.npz`. It will crea
 python preprocess.py traj.npz graph.npz
 ```

-Note that the graph construction is slow especially for large MD trajectories. There two different graph construction algorithms implemented. The default `--backend kdtree` has a linear scaling but only works for orthogonal simulation box. For non-orthogonal simulation, use flag `--backend direct` which has a quadratic scaling. You can also take advantage of the multiprocessing with flag `--n-workers`. For other flags, checkout the help information with `python preprocess.py -h`.
+Note that the graph construction is slow especially for large MD trajectories. There two different graph construction algorithms implemented. The default `--backend kdtree` has a linear scaling but only works for orthogonal simulation box. For non-orthogonal simulation, use flag `--backend direct` or `--backend ndirect` which has a quadratic scaling (for the two choices, the latter is specially efficient for large cells, while the former could be quick for small ones). You can also take advantage of the multiprocessing with flag `--n-workers`. For other flags, checkout the help information with `python preprocess.py -h`.


Small typos:
Two different graph convolution algorithm -> three

txie-93 · 2019-10-17T03:17:06Z

gdynet/parsers.py

-                         'lattices but has quadratic scaling. '
+                         'lattices but has quadratic scaling. "ndirect" is '
+                         'an enhanced method for "direct" which could '
+                         'accelarate the process ofdealing with large lattices.'


ofdealing -> of dealing

txie-93 · 2019-10-17T03:17:21Z

gdynet/parsers.py

-prep_parser.add_argument('--backend', choices=['kdtree', 'direct'],
-                         default='kdtree', help='either "kdtree" or "direct", '
+prep_parser.add_argument('--backend', choices=['kdtree', 'direct', 'ndirect'],
+                         default='kdtree', help='"kdtree", "direct" or "ndirect" available, '


Remove “available” after “ndirect”

txie-93 · 2019-10-17T03:19:51Z

gdynet/preprocess.py

@@ -169,6 +169,33 @@ def construct_graph(self, traj_coords, lattices, atom_types, target_index):
                    'target_index': target_index,
                    'nbr_lists': nbr_lists,
                    'nbr_dists': nbr_dists}
+        elif self.backend == 'ndirect':
+            stcs = [Structure(lattice=lattices[i],


Don’t use such complex list comprehensions. Use a for loop for code readability.

txie-93 · 2019-10-17T03:20:24Z

gdynet/preprocess.py

+            a, b, c = [np.ceil(2*self.radius/d).astype('int')
+                       for d in stcs[0].lattice.abc]
+            if [a, b, c] != [1, 1, 1]:
+                _ = [stc.make_supercell(


Use a for loop here. As well as several places below.

txie-93 · 2019-10-17T03:25:24Z

gdynet/preprocess.py

+                                 :, 1:1+self.n_nbrs] for stc in tqdm(
+                stcs, desc='Generating neighbor index...', disable=not self.verbose)], dtype='int32')
+            nbr_dists = np.array([np.sort(stc.distance_matrix)[
+                                 :, 1:1+self.n_nbrs] for stc in tqdm(


Can you reformat your code according to PEP8? There should be whitespaces between 1+ for example. You can do it with automated tools.

Cloudac7 · 2019-10-17T14:56:40Z

Thank you for your contribution! The code looks great. I have several very minor issues, mainly about formatting. The code should be ready to merge once you fix those. In addition, did you test if the new algorithm gives the exact same results as the existing two for the given trajectories?

Thanks a lot for pointing out and I will fix soon.

Cloudac7 added 5 commits August 20, 2019 19:53

Improve of function construct_graph

978195c

fix map

5ebf5d3

fix some bugs

62b933c

fix

4595568

add ndirect flag for preprocess

02e4ecd

txie-93 reviewed Oct 17, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A new "ndirect" mode for preprocessing #4

A new "ndirect" mode for preprocessing #4

Cloudac7 commented Oct 11, 2019

txie-93 left a comment

txie-93 Oct 17, 2019

txie-93 Oct 17, 2019

txie-93 Oct 17, 2019

txie-93 Oct 17, 2019

txie-93 Oct 17, 2019

txie-93 Oct 17, 2019 •

edited

Cloudac7 commented Oct 17, 2019

A new "ndirect" mode for preprocessing #4

Are you sure you want to change the base?

A new "ndirect" mode for preprocessing #4

Conversation

Cloudac7 commented Oct 11, 2019

txie-93 left a comment

Choose a reason for hiding this comment

txie-93 Oct 17, 2019

Choose a reason for hiding this comment

txie-93 Oct 17, 2019

Choose a reason for hiding this comment

txie-93 Oct 17, 2019

Choose a reason for hiding this comment

txie-93 Oct 17, 2019

Choose a reason for hiding this comment

txie-93 Oct 17, 2019

Choose a reason for hiding this comment

txie-93 Oct 17, 2019 • edited

Choose a reason for hiding this comment

Cloudac7 commented Oct 17, 2019

txie-93 Oct 17, 2019 •

edited