WIP: Adding support for reading vaspwave.h5 in Wavecar and Chgcar class #3768

Shibu778 · 2024-04-18T10:50:11Z

Dear Maintainers,
This is my first contribution to pymatgen. I am relatively new to opensource and would appreciate your feedback and guidance in improving the added features and this PR. In further commits, I will add read_from_hdf5 functionality to Chgcar and write all the tests.

Summary

Major changes:

feature 1: Added a method read_from_hdf5 in Wavecar class to read WAVECAR related information from vaspwave.h5 file (refer to vaspwiki).
fix 1: made a function to read the legacy WAVECAR format that was previously implemented in Wavecar class.
fix 2: added appropriate comments and doc-string
fix 3: Some part of the pymatgen/io/vasp/outputs.py file got formatted by black formatter

Todos

This is work in progress, following needs to be done

feature 2: Add the same feature in Chgcar class
fix 4: Write appropriate test to read the hdf5 files in using Wavecar and Chgcar class

Pre-commit result

pre-commit run --all-files passed all everything

~\Github\pymatgen> pre-commit run --all-files
ruff.....................................................................Passed
ruff-format..............................................................Passed
check yaml...............................................................Passed
fix end of files.........................................................Passed
trim trailing whitespace.................................................Passed
mypy.....................................................................Passed
codespell................................................................Passed
cython-lint..............................................................Passed
double-quote Cython strings..............................................Passed
blacken-docs.............................................................Passed
markdownlint.............................................................Passed
nbstripout...............................................................Passed

Checklist

Google format doc strings added. Check with ruff.
Type annotations included. Check with mypy.
The tests for fixes in Wavecar class remains unchanged to read legacy WAVECAR file
Tests added for new features.

Summary by CodeRabbit

Refactor
- Improved code readability and formatting in various functions.
- Updated function signatures for clarity.
Documentation
- Enhanced function descriptions with new comments and docstrings.

coderabbitai · 2024-04-18T10:50:23Z

Walkthrough

The recent changes across the pymatgen/io/vasp/outputs.py and tests/io/vasp/test_outputs.py files focus on improving code readability and maintainability. These modifications include importing h5py, enhancing tuple unpacking, refining code formatting, updating function signatures with type annotations, and enriching documentation through comments and docstrings.

Changes

File Path	Changes Summary
`pymatgen/io/vasp/outputs.py`	Added `h5py` import, refactored tuple unpacking, improved formatting, updated function signatures, and enriched documentation.
`tests/io/vasp/test_outputs.py`	Restructured code for readability and maintainability, improved code organization without altering core functionality.

🐰✨
A hop of joy for cleaner code,
With tuples unpacked along the road.
Type hints and docs, now clear as day,
In outputs.py, improvements stay.
Cheers to the devs, with ears so long,
For making the code both swift and strong.
🌟📄

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai generate interesting stats about this repository and render them as a table.
- @coderabbitai show all the console.log statements in this repository.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (invoked as PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger a review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai help to get help.

Additionally, you can add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.

CodeRabbit Configration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 1

coderabbitai

Actionable comments posted: 4

coderabbitai · 2024-04-18T17:33:48Z

pymatgen/io/vasp/outputs.py

+                                (
+                                    self.kpoints,
+                                    self.actual_kpoints,
+                                    self.actual_kpoints_weights,
+                                ) = self._parse_kpoints(elem)


Ensure proper error handling in the __init__ method for file operations.

Consider adding a try-except block around the file opening and parsing logic to handle potential I/O errors or XML parsing issues gracefully.

coderabbitai · 2024-04-18T17:33:48Z

pymatgen/io/vasp/outputs.py

+                [
+                    max(cbm_spins[0] - vbm_spins[0], 0),
+                    max(cbm_spins[1] - vbm_spins[1], 0),
+                ],
                [cbm_spins[0], cbm_spins[1]],
                [vbm_spins[0], vbm_spins[1]],
-                [vbm_spins_kpoints[0] == cbm_spins_kpoints[0], vbm_spins_kpoints[1] == cbm_spins_kpoints[1]],
+                [
+                    vbm_spins_kpoints[0] == cbm_spins_kpoints[0],
+                    vbm_spins_kpoints[1] == cbm_spins_kpoints[1],
+                ],


Refine the method for extracting eigenvalue band properties.

- def eigenvalue_band_properties(self): + def get_eigenvalue_band_properties(self):

Rename the method to follow Python's naming conventions for methods that return data.

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Suggested change

[

max(cbm_spins[0] - vbm_spins[0], 0),

max(cbm_spins[1] - vbm_spins[1], 0),

],

[cbm_spins[0], cbm_spins[1]],

[vbm_spins[0], vbm_spins[1]],

[vbm_spins_kpoints[0] == cbm_spins_kpoints[0], vbm_spins_kpoints[1] == cbm_spins_kpoints[1]],

[

vbm_spins_kpoints[0] == cbm_spins_kpoints[0],

vbm_spins_kpoints[1] == cbm_spins_kpoints[1],

],

[

max(cbm_spins[0] - vbm_spins[0], 0),

max(cbm_spins[1] - vbm_spins[1], 0),

],

[cbm_spins[0], cbm_spins[1]],

[vbm_spins[0], vbm_spins[1]],

[

vbm_spins_kpoints[0] == cbm_spins_kpoints[0],

vbm_spins_kpoints[1] == cbm_spins_kpoints[1],

],

Ensure the data structure for WSWQ is correctly initialized.

Verify that the initialization of the data arrays in the WSWQ class handles all possible configurations correctly, especially with different spin and k-point counts.

Optimize the file writing process in the write_file method.

- def write_file(self, filename): + def save_to_file(self, filename):

Rename the method to better reflect its functionality and align with common Python practices.

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.

Suggested change

[

max(cbm_spins[0] - vbm_spins[0], 0),

max(cbm_spins[1] - vbm_spins[1], 0),

],

[cbm_spins[0], cbm_spins[1]],

[vbm_spins[0], vbm_spins[1]],

[vbm_spins_kpoints[0] == cbm_spins_kpoints[0], vbm_spins_kpoints[1] == cbm_spins_kpoints[1]],

[

vbm_spins_kpoints[0] == cbm_spins_kpoints[0],

vbm_spins_kpoints[1] == cbm_spins_kpoints[1],

],

[

max(cbm_spins[0] - vbm_spins[0], 0),

max(cbm_spins[1] - vbm_spins[1], 0),

],

[cbm_spins[0], cbm_spins[1]],

[vbm_spins[0], vbm_spins[1]],

[

vbm_spins_kpoints[0] == cbm_spins_kpoints[0],

vbm_spins_kpoints[1] == cbm_spins_kpoints[1],

],

Shibu778 · 2024-04-19T04:03:18Z

tests/io/vasp/test_outputs.py

@@ -1736,6 +1819,32 @@ def test_standard(self):
        finally:
            sys.stdout = saved_stdout

+    def test_vaspwave(self):


This functions checks whether the information read from legacy and hdf5 files are same or not.

Shibu778 · 2024-04-19T04:05:03Z

tests/io/vasp/test_outputs.py

+            for b in range(vaspwave.nb):
+                assert len(vaspwave.coeffs[k][b]) == len(wavecar.Gpoints[k])
+
+        with pytest.raises(ValueError, match="invalid filetype='json'"):


Proper filetype should be given.

Shibu778 · 2024-04-19T04:08:12Z

tests/io/vasp/test_outputs.py

+        with pytest.raises(ValueError, match="invalid filetype='json'"):
+            Wavecar(f"{VASP_OUT_DIR}/vaspwave.fccSi.h5.gz", filetype="json", vasp_type="std")
+
+        with pytest.raises(ValueError, match="Invalid rtag=.+, must be one of"):


Default filetype is legacy. It will result in Value error for in valid rtag while reading hdf5 file as it tries to read the hdf5 file as legacy WAVECAR file. So, hdf5 filetype should be provided while reading hdf5 file.

Added support for vaspwave.h5

bfb1c25

Shibu778 requested review from shyuep, janosh and mkhorton as code owners April 18, 2024 10:50

coderabbitai bot reviewed Apr 18, 2024

View reviewed changes

Shibu778 added 3 commits April 18, 2024 20:57

Bug fix in read_from_hdf5

239bafe

added tests for vaspwave.h5 reading

cdb32a1

Merge branch 'materialsproject:master' into master

2c6432f

coderabbitai bot reviewed Apr 18, 2024

View reviewed changes

Shibu778 commented Apr 19, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Adding support for reading vaspwave.h5 in Wavecar and Chgcar class #3768

WIP: Adding support for reading vaspwave.h5 in Wavecar and Chgcar class #3768

Shibu778 commented Apr 18, 2024 •

edited

coderabbitai bot commented Apr 18, 2024 •

edited

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot left a comment

coderabbitai bot Apr 18, 2024

coderabbitai bot Apr 18, 2024

Shibu778 Apr 19, 2024

Shibu778 Apr 19, 2024

Shibu778 Apr 19, 2024

WIP: Adding support for reading vaspwave.h5 in Wavecar and Chgcar class #3768

Are you sure you want to change the base?

WIP: Adding support for reading vaspwave.h5 in Wavecar and Chgcar class #3768

Conversation

Shibu778 commented Apr 18, 2024 • edited

Summary

Todos

Pre-commit result

Checklist

Summary by CodeRabbit

coderabbitai bot commented Apr 18, 2024 • edited

Walkthrough

Changes

Chat

CodeRabbit Commands (invoked as PR comments)

CodeRabbit Configration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Apr 18, 2024

Choose a reason for hiding this comment

coderabbitai bot Apr 18, 2024

Choose a reason for hiding this comment

Shibu778 Apr 19, 2024

Choose a reason for hiding this comment

Shibu778 Apr 19, 2024

Choose a reason for hiding this comment

Shibu778 Apr 19, 2024

Choose a reason for hiding this comment

Shibu778 commented Apr 18, 2024 •

edited

coderabbitai bot commented Apr 18, 2024 •

edited

CodeRabbit Configration File (`.coderabbit.yaml`)