New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes of joacmue #14
Open
ulfgebhardt
wants to merge
139
commits into
bundestag:master
Choose a base branch
from
joacmue:master
base: master
Could not load branches
Branch not found: {{ refName }}
Could not load tags
Nothing to show
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 18 commits
Commits
Show all changes
139 commits
Select commit
Hold shift + click to select a range
4177126
python3-ified some scripts
e7ad0c7
minor clean-up
ba7cc1c
made the --name only variant work
b1b935a
corrected fault with heading indentation
9487f75
Made tables work... sort of.
662fa16
made BR (line break?) work
fee8cde
fixed multi-row headers and lists
d13c8c8
skipping line breaks in "kommentar"
a8aaf26
Made lists render inside table (no indentation)
e761381
prettyfying alphanumeric list indices
69f3a71
clean-up of todos, tables should work now
a4609e0
python3-ed the print statements in lawgit.py, made .jsons better read…
5af2990
prettifying .json outputs (indents, utf-8 umlauts)
joacmue d1e84e8
made the banz scraper work again
joacmue 4dca275
Added some notes on what this actually does
joacmue 5d64c00
Merge remote-tracking branch 'gesetze-tools-upstream/master'
joacmue 9c1aa53
Running lawdown in python3 helps -.-
joacmue f7ebdf1
Should have re-added most of the f' strings instead of the u' ones
joacmue f91a8cf
Some suggested changes from the PR
joacmue 5bdfd5e
Suggested Changes from the PR
joacmue 4da5c5c
re-adding bgbl folder to .gitignore
joacmue 70f8035
python3-ified some scripts
6dc6e7a
minor clean-up
7dd3f8c
made the --name only variant work
ab9b64f
Made tables work... sort of.
4005d7e
made BR (line break?) work
52b80c8
fixed multi-row headers and lists
b324727
skipping line breaks in "kommentar"
8e49ec8
Made lists render inside table (no indentation)
f64454f
prettyfying alphanumeric list indices
f9cb296
clean-up of todos, tables should work now
b25e56a
python3-ed the print statements in lawgit.py, made .jsons better read…
937b86f
prettifying .json outputs (indents, utf-8 umlauts)
joacmue b29530a
made the banz scraper work again
joacmue d3d84ba
Added some notes on what this actually does
joacmue 2bd7a75
Running lawdown in python3 helps -.-
joacmue ee9bf6b
Should have re-added most of the f' strings instead of the u' ones
joacmue eb3ac97
Some suggested changes from the PR
joacmue f36e726
Suggested Changes from the PR
joacmue 09c5004
re-adding bgbl folder to .gitignore
joacmue 35452ff
Merge branch 'master' of https://github.com/joacmue/gesetze-tools
joacmue 17b5ac5
Corrected a copy typo of double brackets
joacmue d4ced4d
Removing two causes of linter errors
joacmue aac5558
removing banz_scraper python 2.x leftovers
joacmue f70a0ed
Removing some linter warnings
joacmue 53b96fc
Minor clean-up
joacmue fb39d0a
minor clean-up
joacmue 92b2bf7
Continuing to please the linter
joacmue 823d076
Minor modifications.
darkdragon-001 2209244
Update data in separate commits/branches.
darkdragon-001 885061a
Some fixes
darkdragon-001 31f948a
Merge remote-tracking branch 'origin/master' into joacmue
darkdragon-001 246cc82
Removing regex qualifiers from non-regex strings
joacmue c41acef
Merge branch 'master' of https://github.com/joacmue/gesetze-tools
joacmue b065734
Re-adding the default flush when outside tables
joacmue 49ef8a6
Removing special handling of lettered list indices
joacmue 532da90
Cleaning up the backspaces in tables & lists
joacmue 3a01c61
not printing leading line break on table headers
joacmue d48adf3
Removing mess around handling breaks
joacmue 2264d43
Cleaning up custombreaks
joacmue 279e19e
Adding empty cells for colspans
joacmue 73aaf7b
Something was strange with the round function
joacmue a7b7e82
Making breaks on encounters of <BR> again
joacmue e4cf4bb
Removing special case for begin of <br>
joacmue 2d13b97
Explicitly parsing colnames for colspans now
joacmue 17bf66b
Making lawdown go over all laws without errors
joacmue fab87e6
Making multiline headers with colspan render nicer
joacmue b117e25
Cleaning up column list handling
joacmue 358fd09
python3-ified some scripts
ac34cd9
minor clean-up
b1152ed
made the --name only variant work
ed7c5a0
Made tables work... sort of.
84c986b
made BR (line break?) work
923c6ec
fixed multi-row headers and lists
389e419
skipping line breaks in "kommentar"
abaefbf
Made lists render inside table (no indentation)
2264272
prettyfying alphanumeric list indices
c5b2048
clean-up of todos, tables should work now
1e21aaf
python3-ed the print statements in lawgit.py, made .jsons better read…
bd9763a
prettifying .json outputs (indents, utf-8 umlauts)
joacmue fe61b75
made the banz scraper work again
joacmue 59eea7e
Added some notes on what this actually does
joacmue 7128419
Running lawdown in python3 helps -.-
joacmue 33e3ee6
Should have re-added most of the f' strings instead of the u' ones
joacmue 5058f20
Some suggested changes from the PR
joacmue b0fa94d
Suggested Changes from the PR
joacmue 7361631
re-adding bgbl folder to .gitignore
joacmue 6ec2842
python3-ified some scripts
a93f191
minor clean-up
4042407
made the --name only variant work
0c916fd
corrected fault with heading indentation
0cd86c3
Made tables work... sort of.
6319695
made BR (line break?) work
c3262fa
fixed multi-row headers and lists
6ce7e2e
skipping line breaks in "kommentar"
ff0ec27
Made lists render inside table (no indentation)
c408faf
prettyfying alphanumeric list indices
7e042b1
clean-up of todos, tables should work now
355d84b
python3-ed the print statements in lawgit.py, made .jsons better read…
b5ef45c
prettifying .json outputs (indents, utf-8 umlauts)
joacmue 699ab06
made the banz scraper work again
joacmue dce5570
Running lawdown in python3 helps -.-
joacmue 4ec214e
Should have re-added most of the f' strings instead of the u' ones
joacmue 0836f20
Some suggested changes from the PR
joacmue a6ad1e6
Suggested Changes from the PR
joacmue 3716a55
re-adding bgbl folder to .gitignore
joacmue fe153d3
Corrected a copy typo of double brackets
joacmue 1689312
Removing two causes of linter errors
joacmue aa9ad98
removing banz_scraper python 2.x leftovers
joacmue 52e0641
Removing some linter warnings
joacmue 170c0ff
Minor clean-up
joacmue f21b36f
minor clean-up
joacmue 6227fcb
Continuing to please the linter
joacmue 2d34d37
Removing regex qualifiers from non-regex strings
joacmue 91bf7ea
Minor modifications.
darkdragon-001 f737049
Update data in separate commits/branches.
darkdragon-001 f1ec414
Some fixes
darkdragon-001 8e22485
Improve issue templates.
darkdragon-001 d2fcb9d
Try to fix formatting template.
darkdragon-001 f820c53
Enable CI also for PRs.
darkdragon-001 f8026bf
Re-adding the default flush when outside tables
joacmue 06fe657
Removing special handling of lettered list indices
joacmue 5dae428
Cleaning up the backspaces in tables & lists
joacmue 4905e9a
not printing leading line break on table headers
joacmue 1d2026a
Removing mess around handling breaks
joacmue 45bb01b
Cleaning up custombreaks
joacmue 99dc3c0
Adding empty cells for colspans
joacmue 5f41808
Something was strange with the round function
joacmue 2f61890
Making breaks on encounters of <BR> again
joacmue 625bd5e
Removing special case for begin of <br>
joacmue 83bcc5c
Explicitly parsing colnames for colspans now
joacmue 724c33b
Making lawdown go over all laws without errors
joacmue 074e674
Making multiline headers with colspan render nicer
joacmue bd96212
Cleaning up column list handling
joacmue 0707c01
Rebased to master
joacmue 10cff58
Re-rean bgbl_scraper, updated readme.md
joacmue c66ed0c
Merge branch 'master' of https://github.com/joacmue/gesetze-tools
joacmue 978b871
aligned vkbl.json formatting with other files
joacmue 4c76092
Minor fixes
darkdragon-001 File filter
Filter by extension
Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,7 @@ | ||
test.zip | ||
test.py | ||
bgbl | ||
laws | ||
laws-md | ||
.vscode | ||
__pycache__ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||
---|---|---|---|---|
|
@@ -4,7 +4,6 @@ BundesGit Gesetze Tools | |||
These scripts are used to keep the law repository up to date. | ||||
|
||||
Install requirements: | ||||
|
||||
```bash | ||||
pip install -r requirements.txt | ||||
``` | ||||
|
@@ -17,28 +16,57 @@ Downloads all laws as XML files from | |||
[www.gesetze-im-internet.de](http://www.gesetze-im-internet.de/) | ||||
and extracts them to a directory. | ||||
|
||||
Last tested: 2017-01-14 SUCCESS | ||||
### Useage | ||||
darkdragon-001 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||
Update your list of laws first: | ||||
```bash | ||||
python lawde.py updatelist | ||||
python lawde.py loadall | ||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
This is still stated below and after the note. |
||||
``` | ||||
|
||||
You can then download all laws by calling (<span style="color:red">**not recommended!**</span>) | ||||
```bash | ||||
python lawde.py loadall | ||||
``` | ||||
Which will take approx. 2-3hrs. | ||||
|
||||
Alternatively, you can find the individual law you're interested in in [./data/laws.json](./data/laws.json), which is mostly a list of laws in this form: | ||||
```bash | ||||
{"slug": "<shortname>", "name": "<longname>", "abbreviation": "<abbreviation>"} | ||||
``` | ||||
You can download individual laws by calling (<span style="color:red">**recommended**</span>) | ||||
```bash | ||||
python lawde.py load <shortname> | ||||
``` | ||||
|
||||
Last tested: 2020-12-05 SUCCESS | ||||
|
||||
## lawdown.py | ||||
|
||||
Converts all XML laws to Markdown and copies them with other files related | ||||
to the law into specified working directory. | ||||
|
||||
Last tested: 2017-01-14 SUCCESS | ||||
### Useage | ||||
darkdragon-001 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||
```bash | ||||
python lawdown.py convert <inpath> <outpath> | ||||
python lawdown.py convert ./laws ./laws-md | ||||
``` | ||||
|
||||
Last tested: 2020-12-05 SUCCESS | ||||
|
||||
## bgbl_scraper.py | ||||
|
||||
Scrapes the table of contents of all issues of the Bundesgesetzblatt and dumps | ||||
the result to JSON. | ||||
|
||||
Last tested: 2017-01-14 FAILED ("KeyError: xaversid") | ||||
Last tested: 2020-12-05 FAILED ("KeyError: xaversid") | ||||
Issue seems to be a restructure of the Bundesanzeiger Webpage. The original bgbl links get an error 404. | ||||
|
||||
## banz_scraper.py | ||||
|
||||
Scrapes the table of contents of all available issues of the Bundesanzeiger and | ||||
dumps the result to JSON. | ||||
|
||||
Last tested: 2017-01-14 SUCCESS | ||||
Last tested: 2020-12-23 SUCCESS | ||||
|
||||
## vkbl_scraper.py | ||||
|
||||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Who creates these files?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That would be me. Did not want to push my notes and test scripts to the repo. Should probably store those outside the working folder and remove it from .gitignore