Goals

Translators

Swap out remaining usages of VirtualData + HDFWriter to hdf_utils (especially outside io.translators)
- Test all existing translators and to make sure they still work.
Gateway translators for SPM:
- Gwyddion:
  - Fill in details into new skeleton translator. Use jupyter notebook for reference.
  - For native .gwy format use package - gwyfile (already added to requirements)
  - For simpler .gsf format use gsf_read()
- WsXM:
  - Adopt Iaroslav Gaponenko's reader
  - Note that the above file only reads scan images and does not appear to work on other kinds of data
Translators for popular AFM / SPM formats
- Bruker / Veeco / Digital Instruments - Done but look into Iaroslav Gaponenko's code to add any missing functionality / normalization of data. etc. Also check against this project.
- Nanonis - done but look into Iaroslav Gaponenko reader to make sure nothing is missed out / done incorrectly.
  - Address .SXM, .3DS, and .DAT translation issues
- Asylum ARDF - Use Liam's data + files from asylum
- Park Systems - DaeHee Seol and Prof. Yunseok Kim may help here
- JPK - No data available. GitHub project available for translation from Ross Carter
- Niche data formats:
  - NT-MDT - Data available. translator pending.
  - PiFM - J. Kong from Ginger Group will write a translator upon her visit to CNMS in summer of 2018.
  - Anasys - import anasyspythontools - comes with test data
    - This package does NOT have a pypi installer
    - This package does not look like it is finished.
Write plugins to export to pycroscopy HDF5 for ImageJ, and possibly Gwyddion. There are HDF5 plugins already available for ImageJ.

Ease of use

Extend USIDataset for scientific data types such as an PFMDataset and add domain specific capabilities.
- Example - a BEDataset could have functions like guess() and fit() that automatically instantiate a Fitter object and run the guess and fit
- Operations such as dask_array = bedataset.in_field - view of the HDF5 dataset
- Perhaps this BEDataset could have properties that link it with children like BESHODataset which in turn can have properties such as loop_parameters which happen to be a BELoopDataset object
- Each of these objects will have the capability to tweak the visualize() function in a manner that makes most sense.

Scalability

Make Fitter extend pyUSID.Process in order to enable scalable fitting
- This will automatically allow BESHOFitter and BELoopFitter to scale on HPCs
Chris - Image Processing must be a subclass of Process and implement resuming of computation and checking for old (both already handled quite well in Process itself) - here only because it is used and requested frequently + should not be difficult to restructure.

Outreach

Look into making notebooks for workshops available through mybinder
Clear and obvious examples showing what pycroscopy actually does for people
- Image cleaning
- Signal Filtering
- Two more examples
Upload clean exports of paper notebooks + add notebooks for new papers + add new papers (Sabine + Liam)
Explore Azure Notebooks for live tutorials

Software engineering

Move requirements to requirements.txt
Profile code to see where things are slow
Update change-log with version numbers / releases instead of pull numbers
unit tests for basic data science (Cluster, SVD, Decomposition)
Examples within docs for popular functions
Revisit and address as many pending TODOs as possible
Itk for visualization - https://github.com/InsightSoftwareConsortium/itk-jupyter-widgets
Look into Tasmanian (mainly modeling) - Miroslav Stoyanov

Misc

A sister package with the base labview subvis that enable writing pycroscopy compatible hdf5 files. The actual acquisition can be ignored.
Consider developing a generic curve fitting class a la hyperspy
Chris - Demystify analyis / optimize. Use parallel_compute instead of optimize and guess_methods and fit_methods
Consistency in the naming of and placement of attributes (chan or meas group) in all translators -Some put attributes in the measurement level, some in the channel level! hyperspy appears to create groups solely for the purpose of organizing metadata in a tree structure!
Batch fitting - need to consider notebooks for batch processing of BELINE and other BE datasets. This needs some thought, but a basic visualizer that allows selection of a file from a list and plotting of the essential graphs is needed.

Reorganization

Reorganize code - This is perhaps the last opportunity for major restructuring and renaming.

Subpackages within processing: statistics, image, signal, misc

Make room (in terms of organization) for deep learning - implementation will NOT be part of 0.60.0:

pycroscopy hdf5 to tfrecords / whatever other frameworks use

What science specific functions can be generalized and curated?

Usage of package (only Clustering + SHO fitting for example) probably provides clues about how the package should / could be reorganized (by analysis / process). Typically, most analysis and Process classes have science-specific plotting. Why not insert Procoess / Analysis specific plotting / jupyter functions along with the Process / Fitter class?

Think about whether the rest of the code should be organized by instrument

One possible strategy - .core, .process (science independent), .instrument?. For example px.instrument.AFM.BE would contain translators under a .translators, the two analysis modules and accompanying functions under .analysis and visualization utilities under a .viz submodule. The problem with this is that users may find this needlessly complicated. Retaining existing package structure means that all the modalities are mixed in .analysis, .translators and .viz.

External user contributions

Sabine Neumeyer's cKPFM code
Incorporate sliding FFT into pycroscopy - Rama
Create an IR analysis notebook - Suhas should have something written in IF Drive
Li Xin classification code - Li Xin
Ondrej Dyck’s atom finding code – written well but needs to work on images with different kinds of atoms
Nina Wisinger’s processing code (Tselev) – in progress
Port everything from IFIM Matlab -> Python translation exercises
Iaroslav Gaponenko's Distort correct

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ToDo.rst

ToDo.rst

Goals

Translators

Ease of use

Scalability

Outreach

Software engineering

Misc

Reorganization

External user contributions

Files

ToDo.rst

Latest commit

History

ToDo.rst

File metadata and controls

Goals

Translators

Ease of use

Scalability

Outreach

Software engineering

Misc

Reorganization

External user contributions