Skip to content

Compiler instrumentation + Fortran module + New tool libraries + NCCL support + NVML support + Python tracing + Hatchet + User Metadata + CUPTI PCSampling

Compare
Choose a tag to compare
@jrmadsen jrmadsen released this 29 Jun 08:42
· 12 commits to master since this release
2bdd28e
  • Numerous stability fixes
  • Fortran module
  • Compiler instrumentation
  • NCCL support
  • timemory-mallocp
  • timemory-ncclp
  • timemory-nvml
  • Python line-by-line tracing
  • I/O {read,write}_{char,bytes}
  • Network stats components
  • libunwind support
  • CMake minimum upgraded to 3.15
  • Type-traits for tree/flat/timeline
  • Hierarchical serialization (hatchet support)
  • Concepts
  • Improved settings
  • Python tracer (line-by-line)
  • CTestNotes support
  • Command-line options for settings
  • Migrated cereal to internal (i.e. cereal:: -> tim::cereal::)
  • Dramatically improved Windows support
  • Improved kokkos support
    • Command-line options
    • Print help
  • XML serialization support
  • Shared caches for components
  • Support for C++17 string_view
  • Python bindings to storage classes
  • Windows support for different CPU timers
  • CUDA Cupti PCSampling support (CUDA v11+)
  • User metadata
  • Sampling support in opaque (i.e. within user-bundles)
  • Static polymorphic base for bundlers
  • Namespace re-organization
  • CUDA compilation with Clang compiler
  • Piecewise installation
  • timem support md5sum hashing of command-line
  • papi_threading setting
  • is_invalid in base_state
  • New operations
    • stack_push
    • stack_pop
    • insert
    • set_depth_change
    • set_is_flat
    • set_is_on_stack
    • set_is_invalid
    • set_iterator
    • get_is_flat
    • get_is_invalid
    • get_is_on_stack
    • get_depth
    • get_storage
    • get_iterator