More tolerant docstring parser #31

PicoCentauri · 2021-05-20T10:05:42Z

Our current docstring parser located at

Line 163 in 78fa3f2

def parse_docs(klass):

returns a dictionary of the the docstring. It works but it is not as flexible and tolerant as the sphinx/napoleon implementation. Especially we have problems with the separator between a parameter name and its type; usually denoted by name : type. A different notation can not be parsed since we use a hardcoded split

mdacli/src/mdacli/utils.py

Lines 230 to 232 in 78fa3f2

    
           for line in doc_lines[par_i: end_param_line][::-1]: 
        
               if ' : ' in line: 
        
                   par_name, others_ = line.split(' : ')

Improvements with using a regex did also not succeed. If possible we should incorporate the sphinx parser or at least get some ideas from their implementation.

The text was updated successfully, but these errors were encountered:

joaomcteixeira · 2021-05-20T22:38:28Z

I had committed a re.split in the PR, then we rejected it. Won't that help?
Do you wan to go beyond #23 ?

joaomcteixeira · 2021-05-20T23:10:52Z

Example by @PicoCentauri

"""
    Attributes
    ----------
    dim_fac : int
        Dimensionality :math:`d` of the MSD.
    results.timeseries : :class:`numpy.ndarray`
        The averaged MSD over all the particles with respect to lag-time.
    results.msds_by_particle : :class:`numpy.ndarray`
        The MSD of each individual particle with respect to lag-time.
    ag : :class:`AtomGroup`
        The :class:`AtomGroup` resulting from your selection
    n_frames : int
        Number of frames included in the analysis.
    n_particles : int
        Number of particles MSD was calculated over.


    .. versionadded:: 2.0.0
"""

joaomcteixeira · 2021-05-20T23:13:37Z

"""
    Parameters
    ----------
    universe : Universe
        Universe object
    select : str
        Selection string to evaluate its angular distribution ['byres name OH2']
    bins : int (optional)
        Number of bins to create the histogram by means of :func:`numpy.histogram`
    axis : {'x', 'y', 'z'} (optional)
        Axis to create angle with the vector (HH, OH or dipole) and calculate
        cosine theta ['z'].


    .. versionadded:: 0.11.0

    .. versionchanged:: 1.0.0
       Changed `selection` keyword to `select`
"""

joaomcteixeira · 2021-05-20T23:16:29Z

try again numpydocs. where docstrings updated in mda since #2 ?

PicoCentauri · 2021-05-21T12:16:13Z

I just checked the numpydocs parser and they are still not able to parse docstrings without surrounding spaces...

joaomcteixeira · 2021-05-24T14:35:07Z

Yes,

I tried the other day and I saw it was also a bit of a mess. I think we (ok me 😛 ) should write this more general parser, that is also compatible with the synphx parsing and than create tests on the side of MDAnalysis to enforce new Analysis classes follow these guidelines.

PicoCentauri added enhancement New feature or request help wanted Extra attention is needed labels May 20, 2021

PicoCentauri assigned joaomcteixeira May 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More tolerant docstring parser #31

More tolerant docstring parser #31

PicoCentauri commented May 20, 2021

joaomcteixeira commented May 20, 2021

joaomcteixeira commented May 20, 2021

joaomcteixeira commented May 20, 2021 •

edited

joaomcteixeira commented May 20, 2021

PicoCentauri commented May 21, 2021

joaomcteixeira commented May 24, 2021

More tolerant docstring parser #31

More tolerant docstring parser #31

Comments

PicoCentauri commented May 20, 2021

joaomcteixeira commented May 20, 2021

joaomcteixeira commented May 20, 2021

joaomcteixeira commented May 20, 2021 • edited

joaomcteixeira commented May 20, 2021

PicoCentauri commented May 21, 2021

joaomcteixeira commented May 24, 2021

joaomcteixeira commented May 20, 2021 •

edited