Guide to extracting data in bulk from parsed objects

Tract objects have methods for extracting parsed data from themselves individually.

But PLSSDesc and TractList objects contain methods to extract data from multiple Tract objects all at once--or for grouping / filtering / sorting Tract objects based on their attributes. This can be useful for compiling a spreadsheet, or for finding tracts that match some specific condition.

For a guide to which data fields we can extract via these methods (into a spreadsheet, etc.) -- see this table of Tract attributes.

Extracting data from individual `Tract` objects

To extract data from individual Tract objects, use these methods.

To a dict with `.to_dict()`

Pass a list of the names of attributes we want to collect in a dict:

tract = pytrs.Tract('Lots 1 - 3, NE/4', trs='154n97w14', parse_qq=True)
tract_data = tract.to_dict(['trs', 'desc', 'lots', 'qqs'])

The dict stored to variable tract_data contains these values:

{
    'trs': '154n97w14',
    'desc': 'Lots 1 - 3, NE/4',
    'lots': ['L1', 'L2', 'L3'],
    'qqs': ['NENE', 'NWNE', 'SENE', 'SWNE']
}

To a list with `.to_list()`

Pass a list of the names of attributes we want to collect in a list:

tract = pytrs.Tract('Lots 1 - 3, NE/4', trs='154n97w14', parse_qq=True)
tract_data = tract.to_list(['trs', 'desc', 'lots', 'qqs'])

The list stored to variable tract_data contains these values:

[
    '154n97w14',
    'Lots 1 - 3, NE/4',
    ['L1', 'L2', 'L3'],
    ['NENE', 'NWNE', 'SENE', 'SWNE']
]

Extracting data from `Tract` objects in bulk

PLSSDesc objects and TractList objects both have equivalent methods for extracting Tract data (and sorting / filtering / grouping).

When a method is called by a PLSSDesc object, it will operate on the Tract objects stored in its own .tracts attribute (i.e. the Tract objects generated when it was parsed).

On the other hand, a TractList object might contain Tract objects from any number of sources. Examples below show a TractList that holds the Tract objects from a single source, but we can add any number of them from any number of objects. See the guide on TractList objects for how to collect Tract objects from multiple sources.

Compile `Tract` attributes to dicts with `.tracts_to_dict()` or `.iter_to_dict()`

Use .tracts_to_dict() to get a list of dicts (one dict per Tract). Pass a list of the names of attributes we want to collect in each dict.

raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
all_tract_data = parsed_plssdesc.tracts_to_dict(['trs', 'desc', 'lots', 'qqs'])

# Equivalently with a TractList:
some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
all_tract_data = some_tractlist.tracts_to_dict(['trs', 'desc', 'lots', 'qqs'])

A list of dicts has been stored to variable all_tract_data (one dict for each Tract):

[
    {
        'trs': '154n97w14',
        'desc': 'NE/4',
        'lots': [],
        'qqs': ['NENE', 'NWNE', 'SENE', 'SWNE']
    },

    {
        'trs': '154n97w15',
        'desc': 'Lots 1 - 3, S/2SW/4',
        'lots': ['L1', 'L2', 'L3'],
        'qqs': ['SESW', 'SWSW']
    }
]

Get a `generator` instead with `.iter_to_dict()`

raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
dict_generator = parsed_plssdesc.iter_to_dict(['trs', 'desc', 'lots', 'qqs'])

# Equivalently with a TractList:
some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
dict_generator = some_tractlist.iter_to_dict(['trs', 'desc', 'lots', 'qqs'])

type(dict_generator)        # -> <class 'generator'>
dict_for_tract1 = next(dict_generator)

In this example, dict_for_tract1 looks like this:

{
    'trs': '154n97w14',
    'desc': 'NE/4',
    'lots': [],
    'qqs': ['NENE', 'NWNE', 'SENE', 'SWNE']
}

Compile `Tract` attributes to lists with `.tracts_to_list()` or `.iter_to_list()`

Use .tracts_to_list() to get a nested list of lists (one list per Tract). Pass a list of the names of attributes we want to collect.

raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
all_tract_data = parsed_plssdesc.tracts_to_list(['trs', 'desc', 'lots', 'qqs'])

# Equivalently with a TractList:
some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
all_tract_data = some_tractlist.tracts_to_list(['trs', 'desc', 'lots', 'qqs'])

A list of lists has been stored to variable all_tract_data (one list for each Tract):

[
    ['154n97w14', 'NE/4', [], ['NENE', 'NWNE', 'SENE', 'SWNE']],
    ['154n97w15', 'Lots 1 - 3, S/2SW/4', ['L1', 'L2', 'L3'], ['SESW', 'SWSW']]
]

Get a `generator` instead with `.iter_to_list()`

raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
list_generator = parsed_plssdesc.iter_to_list(['trs', 'desc', 'lots', 'qqs'])

# Equivalently with a TractList:
some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
list_generator = some_tractlist.iter_to_list(['trs', 'desc', 'lots', 'qqs'])

type(dict_generator)        # -> <class 'generator'>
list_for_tract1 = next(list_generator)

In this example, list_for_tract1 looks like this:

['154n97w14', 'NE/4', [], ['NENE', 'NWNE', 'SENE', 'SWNE']]

Write `Tract` attributes to a .csv file with `.tracts_to_csv()`

(Note: A more robust csv writer class is included as pytrs.tractwriter.TractWriter. But this method is simpler, if we just need to dump the data to a csv.)

Use this to write data to a .csv file (one row per Tract).

raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
all_tract_data = parsed_plssdesc.tracts_to_csv(
    attributes=['trs', 'desc', 'lots', 'qqs'],
    fp=<some path>,
    mode='w',
    nice_headers=True)

# Equivalently with a TractList:
some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
all_tract_data = some_tractlist.tracts_to_csv(
    attributes=['trs', 'desc', 'lots', 'qqs'],
    fp=<some path>,
    mode='w',
    nice_headers=True)

Parameter	Explanation	Footnote
`attributes=<list>`	a list of names of the attributes to include
`fp=<path>`	The filepath of the .csv file to write to.
`mode=<'w' or 'a'>`	The mode to open the file in (either `'w'` or `'a'`)	1
`nice_headers=<bool, list, or dict>`	use custom headers (see footnote)	2

mode serves the same purpose as it does in the built-in open() function. Pass either 'w' (a new file, and overwriting any file already at that path) or 'a' to add this data to an existing .csv file.
nice_headers defaults to False (i.e. just use the attribute names themselves as headers). Alternatively, may pass any of the following:

a dict keyed by attribute name, whose values are the headers to use. (Any missing keys will result in using the attribute name itself.)
a list of headers to use. (Should be equal in length to the list passed as attributes, but will not raise an error if that's not the case. The resulting column headers will just be fewer than the actual number of columns.)
pass as True to use the values in the Tract.ATTRIBUTES dict for headers. (Warning: Any value passed that is not a list or dict and that evaluates as True will cause this behavior.)

Print `Tract` attributes to console with `.print_data()`

If you just need to quickly check the data, you can print it to console.

>>> raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

>>> parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
>>> parsed_plssdesc.print_data(['trs', 'desc', 'lots', 'qqs'])

The above example prints this to console:

Tract 1 / 2
trs  : 154n97w14
desc : NE/4
lots : 
qqs  : NENE, NWNE, SENE, SWNE

Tract 2 / 2
trs  : 154n97w15
desc : Lots 1 - 3, S/2SW/4
lots : L1, L2, L3
qqs  : SESW, SWSW

Equivalently with a TractList:

>>> raw_description = """T154N-R97W
Sec 14: NE/4
Sec 15: Lots 1 - 3, S/2SW/4"""

>>> parsed_plssdesc = pytrs.PLSSDesc(raw_description, parse_qq=True)
>>> some_tractlist = pytrs.TractList.from_multiple([parsed_plssdesc])
>>> some_tractlist.print_data(['trs', 'desc', 'lots', 'qqs'])

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extracting_data.md

extracting_data.md

Guide to extracting data in bulk from parsed objects

Extracting data from individual `Tract` objects

To a dict with `.to_dict()`

To a list with `.to_list()`

Extracting data from `Tract` objects in bulk

Compile `Tract` attributes to dicts with `.tracts_to_dict()` or `.iter_to_dict()`

Get a `generator` instead with `.iter_to_dict()`

Compile `Tract` attributes to lists with `.tracts_to_list()` or `.iter_to_list()`

Get a `generator` instead with `.iter_to_list()`

Write `Tract` attributes to a .csv file with `.tracts_to_csv()`

Print `Tract` attributes to console with `.print_data()`

Files

extracting_data.md

Latest commit

History

extracting_data.md

File metadata and controls

Guide to extracting data in bulk from parsed objects

Extracting data from individual Tract objects

To a dict with .to_dict()

To a list with .to_list()

Extracting data from Tract objects in bulk

Compile Tract attributes to dicts with .tracts_to_dict() or .iter_to_dict()

Get a generator instead with .iter_to_dict()

Compile Tract attributes to lists with .tracts_to_list() or .iter_to_list()

Get a generator instead with .iter_to_list()

Write Tract attributes to a .csv file with .tracts_to_csv()

Print Tract attributes to console with .print_data()

Extracting data from individual `Tract` objects

To a dict with `.to_dict()`

To a list with `.to_list()`

Extracting data from `Tract` objects in bulk

Compile `Tract` attributes to dicts with `.tracts_to_dict()` or `.iter_to_dict()`

Get a `generator` instead with `.iter_to_dict()`

Compile `Tract` attributes to lists with `.tracts_to_list()` or `.iter_to_list()`

Get a `generator` instead with `.iter_to_list()`

Write `Tract` attributes to a .csv file with `.tracts_to_csv()`

Print `Tract` attributes to console with `.print_data()`