Skip to content

Releases: NetApp/harvest

Harvest Nightly Release

06 May 11:55
788f634
Compare
Choose a tag to compare
Pre-release

Nightly builds may include bugs and other issues. You might want to use the stable releases instead.

24.02.0

21 Feb 15:24
8f9201c
Compare
Choose a tag to compare

24.02.0 / 2024-02-21 Release

πŸ“Œ Highlights of this major release include:

  • New Datacenter dashboard which contains node health, capacity, performance, storage efficiency, issues, snapshot, power, and temperature details.

  • Harvest includes SnapMirror active sync EMS events with alert rules. Thanks @Nikhita-13 for reporting.

  • Harvest monitors FlexCache performance metrics and includes a new FlexCache dashboard to visualize them. Thanks to @ewilts for raising.

  • Harvest detects HA pair down and sensor failures. These are shown in the Health dashboard. Thanks to @johnwarlick for raising.

  • Harvest monitors MetroCluster diagnostics and shows them in the MetroCluster dashboard. Thanks to @wagneradrian92 for reporting.

  • We improved the performance of all dashboards that include topk queries. Thanks to @mamoep for reporting!

  • We added filter support for the ZapiPerf collector. See filter for more detail. Thanks to @debbrata-netapp for reporting.

  • A bin/harvest grafana customize command that writes the dashboards to the filesystem so other programs can manage them. Thanks to @nicolai-hornung-bl for reporting!

  • We fixed an intermittent latency spike problem that impacted all perf objects. Thanks to @summertony15 and @rodenj1 for reporting this critical issue.

  • ⭐ Several of the existing dashboards include new panels in this release:

    • Node and Aggregate dashboard include volume stats panels. Thanks to @BrendonA667 for raising.
    • SVM dashboard includes volume capacity panels. Thanks to @BrendonA667 for raising.
    • SnapMirror dashboard includes automated_failover and automated_failover_duplex policies.
  • More Harvest dashboard dropdown variables include the All option. Making it easier to get an overview of your environment.

  • All EMS alerts include an impact annotation. Thanks to @divya for raising.

  • 🌾 Harvest includes new templates to collect:

    • Network filesystem (NFS) rewinds performance metrics (rw_ctx). Thanks to @shawnahall71 for raising
    • Network data management protocol (NDMP) session metrics. Thanks to @schumijo for raising.
  • πŸ“• Documentation additions

    • Harvest describe why and how to configure Docker's logging drivers Docker logging configuration Thanks to @madaan for raising.
    • How to create templates that use ONTAP's private CLI details
    • How to create custom Grafana dashboards Steps
    • How to validate your harvest.yml file and share a redacted copy with the Harvest team. Details
    • Harvest describes high-level concepts here Thanks to @norespers for raising.
  • All constituents are disabled by default for workload detail performance templates.

  • The bin/harvest zapi CLI now supports a timeout argument.

  • Harvest performance collectors (ZapiPerf and RestPerf) ask ONTAP for performance counter metadata every 24 hours instead of every 20 minutes. Thanks to BrianMa for raising.

  • The Harvest REST collector's api_time metric now includes the API time for all template endpoints. Thanks to ChristopherWilcox for raising.

Announcements

‼️ IMPORTANT Release 24.02 disables four templates that collected metrics not used in dashboards.
These four templates are disabled by default: ObjectStoreClient, TokenManager, OntapS3SVM, and Vscan.
This change was made to reduce the number of collected metrics.
If you require these templates, you can enable them by uncommenting them in their corresponding default.yaml or by extending the existing object template.

πŸ”Ί IMPORTANT The minimum version of Prometheus required to run Harvest is now 2.33.
Version 2.33 is required to take advantage of Prometheus's @ modifier.
Please upgrade your Prometheus server to at least 2.33 before upgrading Harvest.

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the 'bin/harvest grafana import' CLI, from the Grafana UI, or from the 'Maintenance > Reset Harvest Dashboards' button in NAbox.

Known Issues

  • Harvest does not calculate power metrics for AFF A250 systems. This data is not available from ONTAP via ZAPI or REST.
    See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug
    1473892 for more details.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors
like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18.
The default for TLS client connections was changed to TLS 1.2 in Go 1.18.
Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in
your harvest.yml poller section.
See #1007 for more details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@shawnahall71, @pilot7777, @ben, @madaan, @johnwarlick, @jfong5040, @santosh725, @summertony15, @jmg011, @cheese1, @mamoep, @Falcon667, @dess, @debbrata-netapp, @ewilts,
@Nikhita-13, @norespers, @nicolai-hornung-bl, @BrendonA667, @schumijo, @divya, @joshuacook-tamu, @wagneradrian92, @george-strother

🌱 This release includes 26 features, 24 bug fixes, 20 documentation, 3 styling, 5 refactoring, 11 miscellaneous, and 12 ci pull requests.

πŸš€ Features

  • Include Start Time, Exported Metrics, And Poll Duration In Collector logs (#2493)
  • Adding Rw_ctx Zapiperf Object Template (#2494)
  • Change Pollcounter Schedule To 24H (#2499)
  • Add Ha Down And Sensor Issues In Health Dashboard (#2519)
  • Adding Ndmp Session Rest Template (#2531)
  • Use Modifier For Topk To Improve Svm Dashboard Performance (#2553)
  • Add Timeout For Zapi Cli (#2566)
  • Restperf Disk Plugin Should Support Metric Customization (#2573)
  • Add Filter Support For Zapiperf Collector (#2575)
  • FlexCache Monitoring (#2583)
  • Supporting Automated_failover, Automated_failover_duplex Policy In Sm (#2584)
  • Disabled The Templates Whose All Metrics Are Not Consumed In Dashboards (#2587)
  • Harvest Should Include Snapmirror Active Sync Ems Events (#2588)
  • Use Modifier For Topk To Improve Dashboard Performance (#2590)
  • Harvest Should Include A Snapmirror Active Sync Template (#2596)
  • Disable Constituents By Default For Workload Detail Performance Templates (#2598)
  • Adding Template For Metrocluster Diagnostics Check (#2601)
  • Adding Per Volume Panels In Svm Dashboard (#2602)
  • Add Grafana Customize Command (#2619)
  • Add Volume Stats To Node And Aggregate Dashboard (#2627)
  • Ems Alerts Should Include An Impact Annotation (#2631)
  • Improving Debug Log Clarity And Reducing Noise (#2637)
  • Datacenter Dashboard (#2650)
  • Harvest Dashboards Should Include An All Option (#2661)
  • Percent Unit Panels Should Use Decimal Points (#2663)
  • Change Stat Panel For Uptime,Power Status,Fan Status To Table In Node Dashboard (#2668)

πŸ› Bug Fixes

  • Handled Missing Uuid In Volume For Change_log (#2478)
  • Remove Docs From Deb Binary (#2489)
  • Parsed Logger Changes (#2490)
  • Array Metrics Should Have Correct Base Label In Zapiperf (#2496)
  • Harvest Should Collect Luns In Qtress (#2502)
  • Grafana Export Should Set Correct Permissions (#2505)
  • Begin Log For Pollcounter And Pollinstance Should Be In Ms ([#2509](https://githu...
Read more

23.11.0

13 Nov 15:18
166b026
Compare
Choose a tag to compare

23.11.0 / 2023-11-13 Release

πŸ“Œ Highlights of this major release include:

  • New FlexGroup dashboard that includes FlexGroup constituents. Thanks to @Sandromuc and @ewilts for raising.

  • Harvest ChangeLog plugin to detect and monitor changes related to object creation, modification, and deletion.

  • We improved how Harvest calculates power. As a result, you may notice a decrease in the reported power metrics compared to previous versions. Details here. Thanks to Evan Lee for reporting!

  • Added conf_path variable for specifying the search path of Harvest templates.

  • πŸ“¦ Streamlined the Harvest container installation process by eliminating the need to download a tar file. Running Harvest in a container is now simpler and more convenient.

  • ⭐ Several of the existing dashboards include new panels in this release:

    • Aggregate and Volume dashboard includes performance and capacity tier data. Thanks to @ewilts for raising.
    • Workload dashboard includes QoS fixed Utilization % panels. Thanks to @faguayot for raising.
    • Disk Dashboard features performance panels at the disk raid-group level. Thanks to @kinderr95 for raising.
  • 🌾 Harvest includes new templates to collect:

    • Cloud target metrics. Thanks to @mamoep for raising
    • CIFS Share metrics. Thanks to @s-kuchi for raising
    • IWarp metrics are included in RestPerf
    • object_store_server metrics are included in RestPerf
    • SMB2 metrics are included in RestPerf
  • πŸ“• Documentation additions

    • Enhanced Quickstart guide for Harvest
    • NABox logs collection guide
    • Document poller ca_cert property. Thanks to Marvin Montanus for reporting!
    • Describe how Harvest calculates power. Thanks to Evan Lee for reporting!
    • Details about hidden_fields and filter for the Rest Collector. Thanks to Johnathan Warlick for raising!
  • Enhanced the Volume dashboard to include clone information.

  • ⚑ Optimized the Harvest binaries, significantly reducing their size.

  • The Metadata dashboard works inside container deployments.

  • The FabricPool panels in the Volume dashboard now support FlexGroup volumes. Thanks to @sriniji for reporting.

  • Large harvest.yml files can be refactoring into smaller ones. Thanks to @llelik and @Pengng88 for raising.

  • πŸ’‘ Added help text about metrics to more Harvest dashboard panels.

Announcements

‼️ IMPORTANT Due to ONTAP bug 1585893 the Harvest team recommends using ZapiPerf instead of RestPerf when collecting performance metrics. The RestPerf collector can be used once you upgrade your cluster to a version of ONTAP with the fix. Details in 1585893.

‼️ IMPORTANT Release 23.11 disables the CIFSSession templates by default. This change was made to prevent the generation of a large number of metrics. If you require these templates, you can enable them. Please be aware that enabling them may result in a significant increase in metric collection time, Harvest memory footprint, and Prometheus used disk space. These metrics are utilized in the SMB2 dashboard.

‼️ IMPORTANT Release 23.11 has updated its power metric calculation algorithm. As a result, you may notice a decrease in the reported power metrics compared to previous versions. To collect these metrics, Rest API permissions are required. For detailed information on the power algorithm, please refer to the power algorithm documentation.

‼️ IMPORTANT NetApp moved their communities from Slack to Discord, please join us there!

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the bin/harvest grafana import CLI, from the Grafana UI, or from the Maintenance > Reset Harvest Dashboards button in NAbox.

Known Issues

  • Some AFF A250 systems do not report power metrics. See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug
    1473892 for more details.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@Garydep, @MrObvious, @Pengng88, @Sandromuc, @ewilts, @faguayot, @jmg011, @kinderr95, @llelik, @mamoep, @rodenj1, @s-kuchi, @shawnahall71, @slater0013, @sriniji, @statdigger, @wyahn1219, AlessandroN, Dave, Diane, Evan Lee, Francesco, Heaven7, Johnathan Warlick, Madaan, Martijn Moret, Marvin Montanus, NicoSeiberth, RBrown, TonyHsieh, Watson9121, dbakerletn, imthenightbird, roller, twodot0h, tymercer

🌱 This release includes 38 features, 26 bug fixes, 24 documentation, 5 performance, 2 refactoring, 12 miscellaneous, and 7 ci pull requests.

πŸš€ Features

  • Change Log Detection In Harvest (#2178)
  • Remove Daemon Dependency (#2195)
  • Enable More Golanglint Linters (#2313)
  • Gcc Is Not Required To Build Harvest (#2322)
  • Ontap Permission Errors Should Be Logged As Errors (#2326)
  • Harvest Should Load Templates From A Set Of Conf Directories (#2329)
  • Ontap Power Calculation For Embedded Shelf (#2333)
  • Enable More Golanglint Linters (#2334)
  • Harvest Auto-Support Should Include Instance Count In Collector Section (#2337)
  • Set Allvalue To Null When Svm Regex Is Applied (#2340)
  • Add Parity For String Types Between Restperf And Zapiperf (#2342)
  • Tiering Data Changes For Volume - Template Change (#2343)
  • Docker Workflow Doesn't Need Tar Download (#2354)
  • Enable Ports By Default In Docker Generate (#2360)
  • Support Comma Separated Aggrs In Perf Metrics (#2376)
  • Harvest Should Support Multiple Poller Files To Allow Refactori… (#2388)
  • Adding Iwarp Restperf Template (#2390)
  • Adding New Panels In Disk Dashboard (#2391)
  • Harvest Should Load Templates From A Set Of Conf Directories (#2394)
  • Add Api To Rest Error Log (#2401)
  • Add Clone Info To Volume Dashboard (#2402)
  • Cifs Share Templates (#2405)
  • Support Flexgroup Constituents In Template (#2410)
  • Add Flexgroup To Fabricpool Panels (#2419)
  • Smb2 Restperf Counters (#2420)
  • Adding Fc Rest Template For Fibre Channel Switch (#2424)
  • Metric Doc Needs To Handle Templates With Same Object Names (#2426)
  • Antiransomwarestate Label Should Be Exported (#2432)
  • Metadata Dashboard Should Work With Containers And Remove System Resources Panel (#2433)
  • Adding Restperf Object_store_server Template (#2435)
  • Update Ci To Use Docker Run And Update Permissions (#2436)
  • Enable More Golanglint Linters (#2439)
  • Qos Fixed Utilization % Panels (#2445)
  • Description Fetched From Ontap Docs Via Cli (#2454)
  • Disable Cifssession Template (#2455)
  • Add Labels Defined In Harvest Config To Metadata Metrics (#2456)
  • Add Link_up Counter For Fcp (#2464)
  • Implementing Support For Randomized Start Times In Tasks (#2465)

πŸ› Bug Fixes

  • Qos Policy Not Updated In Workload Counters ([#2318](#23...
Read more

23.08.0

21 Aug 13:02
874a1f8
Compare
Choose a tag to compare

23.08.0 / 2023-08-21 Release

πŸ“Œ Highlights of this major release include:

  • Harvest Security dashboard highlights compliance using NetApp's Security hardening guide for ONTAP

  • Harvest's credential script supports ONTAP daily credential rotation. Thanks to @mamoep for raising.

  • 🎩 Harvest makes it easy to run with both the ZAPI and REST collectors at the same time. Overlapping resources are deduplicated and only exported once. Harvest will automatically upgrade ZAPI conversations to REST when ZAPIs are suspended or disabled.

  • πŸ’Ž Updated workload dashboard now includes Service Center, Latency Breakdown, and 50 panels

  • πŸ’Ž Cluster dashboard updated to work with FSx. Some panels are blank because FSx does not have that data.

  • πŸ“£ The Harvest team published a couple of screencasts about:

  • ⭐ Several of the existing dashboards include new panels in this release:

    • Aggregate dashboard includes busy volume panels
    • SVM dashboard includes per NFS latency heatmaps. Thanks to @rbrownATnetapp for raising.
    • Volume dashboard includes top resources by other IOPs panel and junction paths. Thanks to @tsohst for raising.
  • All Harvest dashboard tables include column filters

  • Harvest dashboards use color to highlight latency and busy threshold breaches

  • Harvest's Prometheus exporter supports TLS

  • 🌾 Harvest includes new templates to collect:

    • Iwarp metrics
    • FCVI metrics
    • Per volume NFS metrics
    • Volume clone metrics
    • QoS workload policy metrics
    • NVME/TCP and NVME/RoCE metrics
    • Flashpool metrics are included in RestPerf. Thanks to @lobster1860 for raising
  • πŸ“• Documentation additions

    • Move more documentation from GitHub to Harvest documentation site
    • Clarify how to tell Harvest to continue using the ZAPI protocol
    • Clarify generic vs custom plugins. Thanks to GregS for raising
    • Clarify which version of Go is required to build Harvest. Thanks to MikeK for raising
    • Clarify how to prepare ONTAP cDOT clusters for Harvest data collection
    • EMS documentation should point to Harvest documentation site. Thanks to @cwaltham for raising
    • Clarify how to gather log files on all platforms
    • Explain how to use the --labels option of bin/harvest grafana. Thanks to @slater0013 for raising
    • Describe how to run docker compose generate command without required Harvest binaries
  • The Harvest doctor command validates collector names listed in your harvest.yml file

  • An earlier version of Harvest collected cloud store information via REST. This release adds the same for ZAPI

  • When ONTAP resources are missing, Harvest tries to collect them every hour. Earlier versions of Harvest waited 24 hours before retrying, which often caused metrics to be missing after a cluster upgrade. Thanks to @Falcon667 for raising

  • Earlier versions of Harvest created world writable auto-support files. These files are now only read/writeable by the current user. Thanks to Bunnygirl for raising

  • bin/harvest import should work with Grafana 10. Thanks to @wooyoungAhn for raising

Announcements

‼️ IMPORTANT 23.08 fixes a REST collector bug that caused partial data collection when ONTAP paginated results. See #2109 for details.

‼️ IMPORTANT Release 23.08 disables the NetConnections and NFSClients templates by default. You can enable them if needed. These templates were disabled because several customers reported that these templates created millions of metrics. None of these metrics are used in Harvest dashboards.

‼️ IMPORTANT Release 23.08 changes how Harvest monitors workloads. For detailed information, please refer to the discussion #2265.

πŸ’‘ The Compliance dashboard was removed after its panels were moved to the Security dashboard.

πŸ‘€ Ambient temperature metric may experience an increase due to issue #2259

‼️ IMPORTANT NetApp moved their communities from Slack to Discord, please join us there!

‼️ IMPORTANT If using Docker Compose and you want to keep your historical Prometheus data, please
read how to migrate your Prometheus volume

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the bin/harvest grafana import CLI, from the Grafana UI, or from the Maintenance > Reset Harvest Dashboards button in NAbox.

Known Issues

  • Some AFF A250 systems do not report power metrics. See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug
    1473892 for more details.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@7840vz, @DAx-cGn, @Falcon667, @Hedius, @LukaszWasko, @MrObvious, @ReneMeier, @Sawall10, @T1r0l, @XDavidT, @amd-eulee, @aticatac, @chadpruden, @cwaltham, @cygio, @ddhti, @debert-ntap, @demalik, @electrocreative, @elsgaard, @ev1963, @faguayot, @iStep2Step, @jgasher, @jmg011, @lobster1860, @mamoep, @matejzero, @matthieu-sudo, @merdos, @pilot7777, @rbrownATnetapp, @rodenj1, @slater0013, @swordfish291, @tsohst, @wooyoungAhn, Alessandro.Nuzzo, Ed Wilts, GregS, Imthenightbird, KlausHub, MeghanaD, MikeK, Paul P2, Rusty Brown, Shubham Mer, Tudor Pascu, Watson9121, jf38800, jfong, lorenzoc, rcl23, roller, scrhobbs, troysmuller, twodot0h

🌱 This release includes 42 features, 40 bug fixes, 20 documentation, 2 performance, 4 testing, 1 styling, 9 refactoring, 20 miscellaneous, and 12 ci pull requests.

πŸš€ Features

  • Harvest Should Collect Iwarp Counters (#2071)
  • Update Visitpanels To Be Recursive (#2085)
  • Add Table Column Filter For Dashboards (#2088)
  • Update Lagtime Based On Lasttransfersize (#2091)
  • Harvest Should Add Grafana Import Rewrite Svm Filtering For Multi-Tenant Support (#2092)
  • Fetch Cloud_store Info In Zapi Via Plugin (#2094)
  • Collection Of Other Counters For Fcvi Perf Object (#2096)
  • Add Nfs Io Types At The Volume Level (#2098)
  • Add System Defined Workload Collection (#2099)
  • Add Workload Panels In Workload Dashboard (#2100)
  • Add Volume Clone Info In Rest (#2102)
  • Added Volume Panels In Aggr Dashboard (#2104)
  • Workload Policy Iops Metrics (#2111)
  • Autoresolve Ems Would Export Metric Value As 0 And Autoresolve=True Label (#2120)
  • Support Type Label For Volume For Backward Compatibility (#2132)
  • Volume Clone Info For Zapi (#2140)
  • Harvest Should Include Numpollers And Rss In Autosupport (#2143)
  • Colors In Grafana Dashboards To Highlight Warning, Critical Severity (#2147)
  • Security Hardening Guide (#2150)
  • Harvest Prometheus Exporter Should Support Tls (#2153)
  • Latency Units Should Be In Microseconds In Harvest Dashboard (#2156)
  • Simplify Rest Auto-Upgrade (#2167)
  • When Using A Credential Script, Re-Auth On 401S (#2180)
  • Upgrade Zapi Conversations To Rest When Zapis Are Suspended Or … (#2200)
  • When Using A Credential Script, Re-Auth On 401S (#2203)
  • Merge Compliance And Security Dashboard + Added Arw Fields (#2207)
  • Supporting Topk In S3 Dashboard (#2208)
  • Aff250 Power Calculation (#2211)
  • Use Single Go Build Command To Build Harvest And Poller Binaries ([#2221](https://github.com/NetApp/harvest...
Read more

23.05.0

03 May 12:42
6f74c7a
Compare
Choose a tag to compare

23.05.0 / 2023-05-03

πŸ“Œ Highlights of this major release include:

  • πŸ’Ž Seven new dashboards:

    • StorageGRID and ONTAP fabric pool
    • Health
    • S3 object storage
    • External service operations
    • Namespace
    • SMB
    • Workloads
  • ⭐ Several of the existing dashboards include new panels in this release:

    • Qtree dashboard includes topK qtrees by disk-used growth
    • StorageGRID Overview dashboard includes traffic classification panels
    • Network dashboard includes net routes
    • Average CPU utilization and CPU busy are included in the cDOT, Cluster, Node, and Metrocluster dashboards
    • SVM dashboard includes LIF counters and the NFS panels filter graphs by NFS version
    • Volume dashboard includes efficiency statistics
    • Aggregate dashboard includes the amount of free space
    • Compliance dashboard only reports on data SVMs
  • πŸ” Harvest can fetch cluster credentials via a credential script. Thanks to Ed Wilts for raising.

  • 🌾 Harvest includes new templates to collect:

    • IP routes. Thanks jfong for contributing!
    • QoS fixed and adaptive policy groups. Thanks @faguayot for raising!
    • Cloud targets and storage
    • Export rules
    • Namespaces
    • CIFS clients
    • LIF counters
    • Volume efficiency stats
  • Harvest containers are published to GitHub's container registry in addition to DockerHub and cr.netapp.io.
    If you're using cr.netapp.io, we encourage you to switch to ghcr.io or DockerHub. In 2024, we will stop publishing to cr.netapp.io

  • Harvest uses a distroless image as its base now - reducing the size of the container and reducing the attack surface

  • Harvest collects 38 additional EMS events and alert rules in this release

  • Harvest EMS alert rules were updated to include better label names and align their severity with Prometheus best practices. Thanks to @7840vz for contributing this feature!

  • The bin/harvest doctor tool validates your custom.yaml template files, checking them for errors.

  • πŸ“• Documentation additions

  • 🎩 Harvest makes it easy to run with both the ZAPI and REST collectors at the same time. Overlapping resources are deduplicated and only published to Prometheus once. This was the final piece in our journey to REST. See rest-strategy.md if you are interested in the details.

Announcements

IMPORTANT The volume_aggr_labels metric is being deprecated in the 23.05 release and will be removed in the 23.08 release of Harvest (#1966) volume_aggr_labels is redundant and the same labels are already available via volume_labels.

IMPORTANT To reduce image and download size, several tools were combined in 23.05. The following binaries are no longer included: bin/grafana, bin/rest, bin/zapi. Use bin/harvest grafana, bin/harvest rest, and bin/harvest zapi instead.

‼️ IMPORTANT NetApp moved their communities from Slack to Discord, please join us there!

‼️ IMPORTANT If using Docker Compose and you want to keep your historical Prometheus data, please
read how to migrate your Prometheus volume

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the bin/harvest grafana import CLI, from the Grafana UI, or from the Maintenance > Reset Harvest Dashboards button in NAbox.

Known Issues

  • Harvest does not calculate power metrics for AFF A250 systems. This data is not available from ONTAP via ZAPI or REST.
    See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug
    1473892 for more details.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@7840vz, @DAx-cGn, @Falcon667, @Hedius, @LukaszWasko, @MrObvious, @ReneMeier, @Sawall10, @T1r0l, @XDavidT, @aticatac, @chadpruden, @cygio, @ddhti, @debert-ntap, @demalik, @electrocreative, @elsgaard, @ev1963, @faguayot, @iStep2Step, @jgasher, @jmg011, @mamoep, @matejzero, @matthieu-sudo, @merdos, @pilot7777, @rodenj1, Alessandro.Nuzzo, Ed Wilts, Imthenightbird, KlausHub, MeghanaD, Paul P2, Rusty Brown, Shubham Mer, Tudor Pascu, Watson9121, jf38800, jfong, rcl23, troysmuller, twodot0h

🌱 This release includes 61 features, 49 bug fixes, 22 documentation, 2 testing, 8 refactoring, 25 miscellaneous, and 32 ci pull requests.

πŸš€ Features

  • Pollers Should Allow Customers To Opt Out Of Rest Upgrade (#1744)
  • Restperf Vscan Counters (#1751)
  • Smb2 Dashboard (#1754)
  • Add Object Count To S3 Metrics (#1759)
  • Enable Golanglint "Unparam" Linter (#1769)
  • Dependabot Should Bump Dependencies (#1777)
  • Print Missing Rest Metrics In Metric Generate Command (#1783)
  • Add Datacenter To Metadata Exporter_time Metrics (#1789)
  • Percentage Panels Should Clamp Min/Max To 0/100% (#1790)
  • Qtree Dashboard Should Include Topk Qtrees By Disk Used Growth (#1792)
  • Harvest Should Collect Ip Routes (#1801)
  • Include Aggregate Encryption Information In Rest/Zapi Templates (#1803)
  • Add Encrypted Field To Aggregate Dashboard (#1804)
  • Harvest Should Include Sg Traffic Classification Panels (#1807)
  • Harvest Should Fetch Auth Via Script (#1819)
  • Delay Center Dashboard (#1824)
  • Publish Harvest Images To Github Container Registry (#1827)
  • Harvest Should Default To Pulling Images From Github Container … (#1830)
  • Harvest Should Collect Qos Policy Groups (#1831)
  • Ontap S3 Dashboard - Config Metrics (#1833)
  • Harvest Should Collect Cloud Targets (#1836)
  • Add Routes To Network Dashboard (#1840)
  • Harvest Should Collect Export Rules (#1843)
  • Workload Dashboard (#1846)
  • Harvest Should Collect Adaptive Qos Policy Groups (#1847)
  • Harvest Should Turn Dashboard Refresh Off (#1849)
  • Namespace Dashboard (#1850)
  • Create Release Issue Template (#1856)
  • Enable Rest Ci Failures (#1858)
  • Bin/Rest Should Be Able To Query All Clusters (#1866)
  • Go Test Should Detect Races And Order Dependent Tests (#1868)
  • Add Average Cpu Utilization And Cpu Busy In Harvest Dashboards (#1872)
  • Harvest Should Use A Distroless Image As Its Base Image Instead… (#1877)
  • Cluster Health Dashboard ([#1881](https...
Read more

23.02.0

21 Feb 14:30
Compare
Choose a tag to compare

23.02.0 / 2023-02-21

πŸ“Œ Highlights of this major release include:

  • ✨ Harvest includes a new file system analytics (FSA) dashboard with directory growth, top directories per volume, and volume usage statistics.

  • Harvest includes a new StorageGRID overview dashboard with performance, storage, information lifecycle management, and node panels. We're collecting suggestions on which StorageGRID dashboards you'd like to see next in issue #1420.

  • πŸ’‘ Power dashboard includes new panels for total power by aggregate disk type, average power per used TB, average IOPs/Watt, total power by aggregate, and information on sensor problems.

  • 🎩 Harvest makes it easy to run with both the ZAPI and REST collectors at the same time. Overlapping resources are deduplicated and only published to Prometheus once. This was the final piece in our journey to REST. See rest-strategy.md if you are interested in the details.

  • πŸ“• We made lots of improvements to Harvest's new documentation site this release including one of the most requested features - a list of Harvest metrics and their corresponding ONTAP ZAPI/REST API mappings. πŸ“ Check it out

  • πŸ’Ž New dashboards and improvements

    • A new file system analytics (FSA) dashboard with directory growth, top directories per volume, and volume usage statistics
    • A new StorageGRID overview dashboard with performance, storage, information lifecycle management, and node panels
    • Power dashboard includes new panels for total power by aggregate disk type, average power per used TB, average IOPs/Watt, total power by aggregate, and information on sensor problems.
    • Disk dashboard shows which node/controller a disk belongs too
    • SVM dashboard shows topK resources in panel drill downs
    • SnapMirror dashboard includes transfer duration, lag time and transfer data panels in addition to new source and destination volume variables to make it easier to understand SnapMirror relationships
    • Aggregate dashboard includes a new flash pool drill down with five new panels
    • Aggregate dashboard includes four new panels showing volume statistics broken down by flexvol/flexgroup space per aggregate
    • SVM dashboard includes NFSv3 latency heatmap panels
    • Node dashboard latency panels updated to use weighted average, bringing them inline with ActiveIQ
    • Volume dashboard includes new inode usage panels
  • Harvest includes a new command bin/harvest grafana metrics which shows which metrics each dashboard uses

Announcements

‼️ IMPORTANT NetApp moved their communities from Slack to Discord, please
join us there!

‼️ IMPORTANT If using Docker Compose and you want to keep your historical Prometheus data, please
read how to migrate your Prometheus volume

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and
fixes. You can import them via the bin/harvest/grafana import CLI, from the Grafana UI, or from
the Maintenance > Reset Harvest Dashboards button in NAbox.

🌻 In the 22.11.0 release notes, we announced that we would be removing quota metrics prefixed with qtree.
Several of you asked us to leave them. πŸ‘ We will continue publishing them as-is.

Known Issues

  • Harvest does not calculate power metrics for AFF A250 systems. This data is not available from ONTAP via ZAPI or REST.
    See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug
    1473892 for more details.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors
like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@Falcon667, @MrObvious, @ReneMeier, @Sawall10, @T1r0l, @aticatac, @chadpruden, @demalik, @electrocreative, @ev1963, @faguayot, @iStep2Step, @jgasher, @jmg011, @mamoep, @matejzero, @matthieu-sudo, @merdos, @rodenj1, Ed Wilts, KlausHub, MeghanaD, Paul P2, Rusty Brown, Shubham Mer, Tudor Pascu, jf38800, jfong, rcl23, troysmuller, twodot0h

🌱 This release includes 43 features, 43 bug fixes, 19 documentation, 2 testing, 1 styling, 5 miscellaneous, and 7 ci pull requests.

πŸš€ Features

  • Add Information To Which Node/Controller A Disk Belongs (#1542)
  • Remove Pass Slice From Matrix Data Structure (#1553)
  • Ensure Dashboards Have Only One Expanded Section (#1554)
  • Plugins Can Use Raw Or Display Metric In Calculations (#1567)
  • Perf Collector Delta Calculation Handling (#1571)
  • Added Dashboard Tests For Legends Details (#1576)
  • Add Bin/Grafana Metrics To Print Which Metrics Each Dashboard… (#1578)
  • Restperf Svm_vscan Template (#1590)
  • Simplify Metrics Storage (#1591)
  • Include Inodes File Usage In Volume Dashboard (#1593)
  • Handle Record Values In Metric Calculation (#1594)
  • Refractor Matrix (#1595)
  • Topk Support In Svm Dashboard - 1 (#1608)
  • Topk Support In Svm Dashboard - 2 (#1609)
  • Topk Support In Svm Dashboard - 3 (#1611)
  • Add Failed Sensors To Power Dashboard (#1621)
  • Include Storagegrid Dashboard In Docker Compose (#1631)
  • Include Storagegrid Dashboard In Docker Compose (#1632)
  • Honor Harvest_no_upgrade Envvar When Zapi Exist (#1636)
  • Harvest Metrics Document (#1641)
  • Added 3 Panels Supported With Relationshipid Data Link (#1642)
  • Adding Flashpool Drilldown Panels In Aggr Dashboard (#1649)
  • List Docker Tags On Cr.netapp.io (#1656)
  • Fsa (#1661)
  • Move Shelf Power To Disk Perf Template (#1665)
  • Aggregate Power For Zapi Collector (#1671)
  • Weighted Avg Support In Aggregator Plugin (#1672)
  • Add Activity To Panel Names (#1675)
  • Add Storagegrid Overview Dashboard (#1677)
  • Calculate Power By Disk Type (#1681)
  • Support Aggr Filter For Flexgroup Volumes (#1691)
  • Calculate Aggr Power Rest Support (#1692)
  • Support Aggr Filter Chnages In Zapiperf (#1695)
  • Calculate Power Per Tb And Watt (#1698)
  • Add Nfsv3 Latency Heatmap (#1699)
  • Add Fsa Full Form To Dashboard Name (#1700)
  • Add Ca Certificate Support For Rest Client (#1705)
  • Support Node Aggregation For Flexgroup Also (#1706)
  • Cluster Dashboard Panel Width Ux Changes (#1723)
  • Include Sg Cluster In Panels (#1725)
  • Shelf Power Panel Alignment Issue (#1728)
  • Topk Panels Should Use Topresources Var In Their Titles (#1733)
  • Pollers Should Allow Customers To Opt Out Of Rest Upgrade (#1744) (#1747)

πŸ› Bug Fixes

Read more

22.11.1

08 Dec 15:25
Compare
Choose a tag to compare

22.11.1 / 2022-12-08

πŸ“Œ This release is the same as 22.11.0 with a fix that makes the REST Perf collector work with the just released ONTAP 9.12.1+ and a few other minor fixes. If you are not monitoring ONTAP 9.12.1, you can ignore this release.

Full Changelog: v22.11.0...v22.11.1

22.11.0

21 Nov 18:36
cd81cbc
Compare
Choose a tag to compare

22.11.0 / 2022-11-21

πŸ“Œ Highlights of this major release include:

  • ✨ Harvest now includes a StorageGRID collector and a Tenant/Buckets dashboard. We're just getting started with StorageGRID dashboards. Please give the collector a try, and let us know which StorageGRID dashboards you'd like to see next.

  • 🎩 The REST collectors are ready! We recommend using them for ONTAP versions 9.12.1 and higher. Today, Harvest collects 1,546 metrics via ZAPI. Harvest includes a full set of REST templates that export identical metrics. All 1,546 metrics are available via Harvest's REST templates and no changes to dashboards or downstream metric-consumers is required. πŸŽ‰ More details on Harvest's REST strategy.

  • πŸ“• Harvest has a new documentation site! This consolidates Harvest documentation into one place and will make it easier to find what you need. Stay tuned for more updates here.

  • πŸ’Ž New and improved dashboards

    • cDOT, high-level cluster overview dashboard
    • Headroom dashboard
    • Quota dashboard
    • Snapmirror dashboard shows source and destination side of relationship
    • NFS clients dashboard
    • Fabric Pool panels are now included in Volume dashboard
    • Tags are included for all default dashboards, making it easier to find what you need
    • Additional throughput, ops, and utilization panels were added to the Aggregate, Disk, and Clusters dashboards
    • Harvest dashboards updated to enable multi-select variables, shared crosshairs, better top n resources support, and all variables are sorted by default.
  • πŸ”’ Harvest code is checked for vulnerabilities on every commit using Go's vulnerability management scanner.

  • Harvest collects additional metrics in this release

    • ONTAP S3 server config metrics
    • User defined volume workload
    • Active network connections
    • NFS connected clients
    • Network ports
    • Netstat packet loss
  • Harvest now converts ONTAP histograms to Prometheus histograms, making it possible to visualize metrics as heatmaps in Grafana

Announcements

‼️ IMPORTANT NetApp moved their communities from Slack to Discord, please join us there!

πŸ’£ Deprecation: Earlier versions of Harvest published quota metrics prefixed with qtree. Harvest release 22.11 deprecates the quota metrics prefixed with qtree and instead publishes quota metrics prefixed with quota. All dashboards have been updated. If you are consuming these metrics outside the default dashboards, please change to quota prefixed metrics. Harvest release 23.02 will remove the deprecated quota metrics prefixed with qtree.

‼️ IMPORTANT If using Docker Compose and you want to keep your historical Prometheus data, please read how to migrate your Prometheus volume

πŸ’‘ IMPORTANT After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the bin/harvest/grafana import CLI, from the Grafana UI, or from the Maintenance > Reset Harvest Dashboards button in NAbox.

Known Issues

  • Harvest does not calculate power metrics for AFF A250 systems. This data is not available from ONTAP via ZAPI or REST. See ONTAP bug 1511476 for more details.

  • ONTAP does not include REST metrics for offbox_vscan_server and offbox_vscan until ONTAP 9.13.1. See ONTAP bug 1473892 for more details.

  • Podman is unable to pull from NetApp's container registry cr.netapp.io. Until issue is resolved, Podman users can pull from a separate proxy like this podman pull netappdownloads.jfrog.io/oss-docker-harvest-production/harvest:latest.

  • 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

  • The Unix collector is unable to monitor pollers running in containers. See #249 for details.

Thanks to all the awesome contributors

🀘 Thanks to all the people who've opened issues, asked questions on Discord, and contributed code or dashboards
this release:

@Falcon667, @MrObvious, @ReneMeier, @Sawall10, @T1r0l, @chadpruden, @demalik, @electrocreative, @ev1963, @faguayot, @iStep2Step, @jgasher, @jmg011, @mamoep, @matthieu-sudo, @merdos, @rodenj1, Ed Wilts, KlausHub, MeghanaD, Paul P2, Rusty Brown, Shubham Mer, jf38800, rcl23, troysmuller

🌱 This release includes 59 features, 90 bug fixes, 21 documentation, 4 testing, 2 styling, 6 refactoring, 2 miscellaneous, and 6 ci commits.

πŸš€ Features

  • Enable Multi Select By Default (#1213)
  • Merge Release 22.08 To Main (#1218)
  • Add Avg Cifs Latency To Svm Dashboard Graph Panel (#1221)
  • Network Port Templates (#1231)
  • Add Node Cpu Busy To Cluster Dashboard (#1243)
  • Improve Poller Startup Logging (#1254)
  • Add Net Connections Template For Rest Collector (#1257)
  • Upgrade Zapi Collector To Rest When The Ontap Version Is >= 9.12.1 (#1261)
  • Run Govulncheck On Make Dev (#1273)
  • Nfsv42 Restperf Templates (#1275)
  • Enable User Defined Volume Workload (#1276)
  • Prometheus Exporter Should Log Address And Port (#1279)
  • Ensure Dashboard Units Align With Ontap's Units (#1280)
  • Panels Should Connect Null Values (#1281)
  • Harvest Should Collect Ontap S3 Server Metrics (#1285)
  • Bin/Zapi Show Counters Should Print Xml Results To Make Parsi… (#1286)
  • Harvest Should Collect Ontap S3 Server Config Metrics (#1287)
  • Harvest Should Publish Cooked Zero Performance Metrics (#1292)
  • Add Grafana Tags On Default Dashboards (#1293)
  • Add Harvest Tags (#1294)
  • Rest Nfs Connections Dashboard (#1297)
  • Cmd Line Objects And Collectors Override Defaults (#1300)
  • Harvest Should Replace Topk With Topk Range In All Dashboards Part 1 (#1301)
  • Harvest Should Replace Topk With Topk Range In All Dashboards Part 2 (#1302)
  • Harvest Should Replace Topk With Topk Range In All Dashboards Part 3 (#1304)
  • Snapmirror From Source Side [Zapi Changes] (#1307)
  • Mcc Plex Panel Fix (#1310)
  • Add Support For Qos Min And Cp In Harvest (#1316)
  • Add Available Ops To Headroom Dashboard (#1317)
  • Added Panels In Cluster, Disk For 1.6 Parity (#1320)
  • Add Storagegrid Collector And Dashboard (#1322)
  • Export Ontap Histograms As Prometheus Histograms (#1326)
  • Solution Based Cdot Dashboard (#1336)
  • Cluster Var Changed To Source_cluster In Snapmirror Dashboard (#1337)
  • Remove Pollinstance From Zapi Collector (#1338)
  • Reduce Memory Footprint Of Set (#1339)
  • Quota Metric Renaming (#1345)
  • Collectors Should Log Polldata, Plugin Times, And Metadata (#1347)
  • Export Ontap Histograms As Prometheus Histograms (#1349)
  • Fabricpool Panels - Parity With 1.6 (#1352)
  • All Dashboards Should Default To Shared Crosshair (#1359)
  • All Dashboards Should Use Multi-Select Dropdowns For Each Variable (#1363)
  • Perf Collector Unit Test Cases (#1373)
  • Remove Metric Labels From Shelf Sensor Plugins ([#1378](https:...
Read more

22.08.0

19 Aug 13:43
93db10a
Compare
Choose a tag to compare

22.08.0 / 2022-08-19

πŸš€ Highlights of this major release include:

  • πŸŽ‡ an ONTAP event management system (EMS) events collector with 64 events out-of-the-box

  • Two new dashboards added in this release

    • Headroom dashboard
    • Quota dashboard
  • We've made lots of improvements to the REST Perf collector. The REST Perf collector should be considered early-access as we continue to improve it. This feature requires ONTAP versions 9.11.1 and higher.

  • New max plugin that creates new metrics from the maximum of existing metrics by label.

  • New compute_metric plugin that creates new metrics by combining existing metrics with mathematical operations.

  • 48 feature, 45 bug fixes, and 11 documentation commits this release

IMPORTANT ‼️ NetApp is moving their communities from Slack to NetApp's Discord with a plan to lock the Slack channel at the end of August. Please join us on Discord!

IMPORTANT ‼️ Prometheus version 2.26 or higher is required for the EMS Collector.

IMPORTANT ‼️ After upgrade, don't forget to re-import your dashboards, so you get all the new enhancements and fixes. You can import them via the bin/harvest/grafana import CLI or from the Grafana UI.

Known Issues

Podman is unable to pull from NetApp's container registry cr.netapp.io. Until issue is resolved, Podman users can pull from a separate proxy like this podman pull netappdownloads.jfrog.io/oss-docker-harvest-production/harvest:latest.

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

The Unix collector is unable to monitor pollers running in containers. See #249 for details.

Enhancements

  • πŸŽ‡ Harvest adds an ONTAP event management system (EMS) events collector in this release.
    It collects ONTAP events, exports them to Prometheus, and provides integration with Prometheus AlertManager.
    Full list of 64 events

  • New Harvest Headroom dashboard. #1039 Thanks to @faguayot for reporting.

  • New Quota dashboard. #1111 Thanks to @ev1963 for raising this feature request.

  • We've made lots of improvements to the REST Perf collector and filled several gaps in this release. #881

  • Harvest Power dashboard should include Min Ambient Temp and Min Temp. Thanks to Papadopoulos Anastasios for reporting.

  • Harvest Disk dashboard should include the Back-to-back CP Count and Write Latency metrics. #1040 Thanks to @faguayot for reporting.

  • Rest templates should be disabled by default until ONTAP removes ZAPI support. That way, Harvest does not double collect and store metrics.

  • Harvest dashboards name prefix should be ONTAP: instead of NetApp Detail:. #1080. Thanks to Martin MΓΆbius for reporting.

  • Harvest Qtree dashboard should show Total Qtree IOPs and Internal IOPs panels and Qtree filter. #1079 Thanks to @mamoep for reporting.

  • Harvest Cluster dashboard should show SVM Performance panel. #1117 Thanks to @Falcon667 for reporting.

  • Combine SnapMirror and Data Protection dashboards. #1082. Thanks to Martin MΓΆbius for reporting.

  • vscan performance object should be enabled by default. #1182 Thanks to Gabriel Conne for reporting on Slack.

  • Lun and Volume dashboard should use topk range. #1184 Thanks to Papadopoulos Anastasios for reporting on Slack. These changes make these dashboards more consistent with Harvest 1.6.

  • New MetricAgent plugin. It is used to manipulate metrics based on a set of rules.

  • New Max plugin. It creates a new collection of metrics by calculating max of metric values from an existing matrix for a given label.

  • bin/zapi should support querying multiple performance counters. #1167

  • Harvest REST private CLI should include filter support

  • Harvest should support request/response logging in Rest/RestPerf Collector.

  • Harvest maximum log file size is reduced from 10mb to 5mb. The maximum number of log files are reduced from 10 to 5.

  • Harvest should consolidate log messages and reduce noise.

Fixes

  • Missing Ambient Temperature for AFF900 in Power Dashboard. #1173 Thanks to @iStep2Step for reporting.

  • Flexgroup latency should match the values reported by ONTAP CLI. #1060 Thanks to @josepaulog for reporting.

  • Perf Zapi Volume label should match Zapi Volume label. The label type was changed to style for Perf ZAPI Volume. #1055 Thanks to Papadopoulos Anastasios for reporting.

  • Zapi:SecurityCert should handle certificates per SVM instead of reporting duplicate instance key errors. #1075 Thanks to @mamoep for reporting.

  • Zapi:SecurityAccount should handle per switch SNMP users instead of reporting duplicate instance key errors. #1088 Thanks to @mamoep for reporting.

  • Wrong throughput units in Disk dashboard. #1091 Thanks to @Falcon667 for reporting.

  • Qtree Dashboard shows no data when SVM/Volume are selected from dropdown. #1099 Thanks to Papadopoulos Anastasios for reporting.

  • Virus Scan connections Active panel in SVM dashboard shows decimal places in Y axis. #1101 Thanks to Rene Meier for reporting.

  • Add Disk Utilization per Aggregate description in Disk Dashboard. #1193 Thanks to @faguayot for reporting.

  • Prometheus exporter should escape label_value. #1128 Thanks to @vavdoshka for reporting.

  • Grafana import dashboard fails if anonymous access is enabled. @1132 Thanks @iStep2Step for reporting.

  • Improve color consistency and hover information on Compliance/Data Protection dashboards. #1083 Thanks to Rene Meier for reporting.

  • Compliance & Security Dashboards the text is unreadable with Grafana light theme. #1078 Thanks to @mamoep for reporting.

  • InfluxDB exporter should not require bucket, org, port, or precision fields when using url. #1155 Thanks to li fi for reporting.

  • Node CPU Busy and Disk Utilization should match the same metrics reported by ONTAP sysstat -m CLI. #1152 Thanks to Papadopoulos Anastasios for reporting.

  • Harvest should detect counter overflow and report it as 0. [#762] Thanks to @rodenj1 for reporting.

  • Zerolog console logger fails to log stack traces. #1044

22.05.0

11 May 12:33
2bc2942
Compare
Choose a tag to compare

Releases

22.05.0 / 2022-05-11

πŸš€ Highlights of this major release include:

  • Early access to ONTAP REST perf collector from ONTAP 9.11.1GA+

  • βŒ› New Container Registry - Several of you have mentioned that you are being rate-limited when pulling Harvest Docker images from DockerHub. To alleviate this problem, we're publishing Harvest images to NetApp's container registry (cr.netapp.io). Going forward, we'll publish images to both DockerHub and cr.netapp.io. More information in the FAQ. No action is required unless you want to switch from DockerHub to cr.netapp.io. If so, the FAQ has you covered.

  • Five new dashboards added in this release

    • Power dashboard
    • Compliance dashboard
    • Security dashboard
    • Qtree dashboard
    • NFSv4 Store Pool dashboard (disabled by default)
  • New value_to_num_regex plugin allows you to map all matching expressions to 1 and non-matching ones to 0.

  • Harvest pollers can optionally read credentials from a mounted volume or file. This enables Hashicorp Vault support and works especially well with Vault agent

  • bin/grafana import provides a --multi flag that rewrites dashboards to include multi-select dropdowns for each variable at the top of the dashboard

  • The conf/rest collector templates received a lot of attentions this release. All known gaps between the ZAPI and REST collector have been filled and there is full parity between the two from ONTAP 9.11+. 🀘

  • 24 bug fixes, 48 feature, and 5 documentation commits this release

IMPORTANT ‼️ After upgrade, don't forget to re-import your dashboards so you get all the new enhancements and fixes. You can import via bin/harvest/grafana import cli or from the Grafana UI.

IMPORTANT The conf/zapiperf/cdot/9.8.0/object_store_client_op.yaml ZapiPerf template is being deprecated in this release and will be removed in the next release of Harvest. No dashboards use the counters defined in this template and all counters are being deprecated by ONTAP. If you are using these counters, please create your own copy of the template.

Known Issues

IMPORTANT 7-mode filers that are not on the latest release of ONTAP may experience TLS connection issues with errors like tls: server selected unsupported protocol version 301 This is caused by a change in Go 1.18. The default for TLS client connections was changed to TLS 1.2 in Go 1.18. Please upgrade your 7-mode filers (recommended) or set tls_min_version: tls10 in your harvest.yml poller section. See #1007 for more details.

The Unix collector is unable to monitor pollers running in containers. See #249 for details.

Enhancements

  • Harvest should include a Power dashboard that shows power consumed, temperatures and fan speeds at a node and shelf level #932 and #903

  • Harvest should include a Security dashboard that shows authentication methods and certificate expiration details for clusters, volume encryption and status of anti-ransomware for volumes and SVMs #935

  • Harvest should include a Compliance dashboard that shows compliance status of clusters and SVMs along with individual compliance attributes #935

  • SVM dashboard should show antivirus counters in the CIFS drill-down section #913 Thanks to @burkl for reporting

  • Cluster and Aggregate dashboards should show Storage Efficiency Ratio metrics #888 Thanks to @Falcon667 for reporting

  • 🚧 This is another step in the ZAPI to REST road map. In earlier releases, we focused on config ZAPIs and in this release we've added early access to an ONTAP REST perf collector. 🎊 The REST perf collector and thirty-nine templates included in this release, require ONTAP 9.11.1GA+ 😲 These should be considered early access as we continue to improve them. If you try them out or have any feedback, let us know on Slack or GitHub #881

  • Harvest should collect NFS v4.2 counters which are new in ONTAP 9.11+ releases #572

  • Plugin logging should include object detail #986

  • Harvest dashboards should use Time series panels instead of Graph (old) panels #972. Thanks to @ybizeul for raising

  • New regex based plugin value_to_num_regex helps map labels to numeric values for Grafana dashboards.

  • Harvest status should run on systems without pgrep #937 Thanks to Dan Butler for reporting this on Slack

  • When using a credentials file and the poller is not found, also consult the defaults section of the harvest.yml file #936

  • Harvest should include an NFSv4 StorePool dashboard that shows NFSv4 store pool locks and allocation detail #921 Thanks to Rusty Brown for contributing this dashboard.

  • REST collector should report cpu-busytime for node #918 Thanks to @pilot7777 for reporting this on Slack

  • Harvest should include a Qtree dashboard that shows Qtree NFS/CIFS metrics #812 Thanks to @ev1963 for reporting

  • Harvest should support reading credentials from an external file or mounted volume #905

  • Grafana dashboards should have checkbox to show multiple objects in variable drop-down. See comment for details. #815 #939 Thanks to @manuelbock, @bcase303 for reporting

  • Harvest should include Prometheus port (promport) to metadata metric #878

  • Harvest should use NetApp's container registry for Docker images #874

  • Increase ZAPI client timeout for default and volume object #1005

  • REST collector should support retrieving a subset of objects via template filtering support #950

  • Harvest should support minimum TLS version config #1007 Thanks to @jmg011 for reporting and verifying this

Fixes

  • SVM Latency numbers differ significantly on Harvest 1.6 vs Harvest 2.0 #1003 See discussion as well. Thanks to @jmg011 for reporting

  • Harvest should include regex patterns to ignore transient volumes related to backup #929. Not enabled by default, see conf/zapi/cdot/9.8.0/volume.yaml for details. Thanks to @ybizeul for reporting

  • Exclude OS aggregates from capacity used graph #327 Thanks to @matejzero for raising

  • Few panels need to have instant property in Data protection dashboard #945

  • CPU overload when there are several thousands of quotas #733 Thanks to @Flo-Fly for reporting

  • Include 7-mode CLI role commands for Harvest user #891 Thanks to @ybizeul for reporting and providing the changes!

  • Zapi Collector fails to collect data if number of records on a poller is equal to batch size #870 Thanks to @unbreakabl3 on Slack for reporting

  • Wrong object name used in conf/zapi/cdot/9.8.0/snapshot.yaml #862 Thanks to @pilot7777 for reporting

  • Field access-time returned by snapshot-get-iter should be creation-time #861 Thanks to @pilot7777 for reporting

  • Harvest panics when trying to merge empty template #859 Thanks to @pilot7777 for raising