fix(framework-core): given files old content and new content, compute the valid hunks #86

TomKristie · 2020-08-19T16:02:59Z

Given a set of files with old content and new content, and the set of valid hunks from the remote PR, generate the hunk for each file given from the user and filter out invalid hunks and invalid files.

I created sub directories to try to add some logical grouping to the user-input to hunk/patch handling logic. This way I can distinguish between handling the user's input, and handling GitHub PR data.
It is likely that I will need to do a follow-up pr to move the github and regex logic into a submodule.

Overview of the important methods:
the src/index.ts will invoke the comment method in src/github-handler/comment-handler/index.ts.
comment gets the github pull request scopes by getPullRequestScope, and parses the user's raw changes to patches by getSuggestionPatches, and creates a pr with those patches (TODO).

This PR focuses on getSuggestionPatches:

get the hunks and filter invalid hunks by getValidSuggestionHunks. The two main submethods to filter invalid hunks are filterOutOfScopeFiles and filterOutOfScopeHunks
get the patches - TODO

Towards #59

codecov · 2020-08-19T16:04:39Z

Codecov Report

❗ No coverage uploaded for pull request base (comment-pr@29f5136). Click here to learn what that means.
The diff coverage is n/a.

@@              Coverage Diff              @@
##             comment-pr      #86   +/-   ##
=============================================
  Coverage              ?   87.88%           
=============================================
  Files                 ?       20           
  Lines                 ?     1874           
  Branches              ?      126           
=============================================
  Hits                  ?     1647           
  Misses                ?      226           
  Partials              ?        1

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 29f5136...0111d88. Read the comment docs.

src/github-handler/comment-handler/suggestion-patch-handler/fix-suggestion-hunks-handler.ts

chingor13 · 2020-08-19T16:57:11Z

src/types/index.ts

+  readonly oldStart: number;
+  readonly oldEnd: number;
+  readonly newStart: number;
+  readonly newEnd: number;


You might want to include the new content here. It would simplify the algorithm:

GitHub patch -> Hunk[] -> filter based on valid ranges/file size -> convert hunk to comments -> create PR review w/ comments

You could skip having to go back and look up the content.
Maybe this is deferred to a later PR.

I use this object when I am getting the hunks. The hunk output from the diff library doesn't produce raw text. It produces a patch text with "+" and "-" prefixing certain lines and also the comment "no new line and end of file" if there is no newline and the end of the file. Therefore I'd need to alter the patch text at this step. So either way I'd need to do a raw text lookup to avoid finicky string manipulation. Therefore I have 2 choices: get and store raw text before I filter the hunks or after. If I do it after the filtration, I save on memory, which could be valuable if some strings are particularly large.

src/github-handler/comment-handler/suggestion-patch-handler/suggestion-hunk-handler.ts

chingor13 · 2020-08-19T18:41:33Z

src/github-handler/comment-handler/suggestion-patch-handler/suggestion-hunk-handler.ts

+  rawContent: RawContent,
+  fileName: string


Could RawContent also track its fileName? It simplifies this method conceptually that we're converting from one RawContent change to a list of Hunks

the user's input is Map<file name, RawContent> and we'd also like to keep it as a map so that we can look up the ranges of what text to apply afterwards (since the diff library text output is in patch format and modifies the original text). Therefore I could map each raw content to a RawContent + file name object. Yup it's doable, but is that object store format a better decision?

src/github-handler/comment-handler/suggestion-patch-handler/index.ts

src/github-handler/comment-handler/suggestion-patch-handler/fix-suggestion-hunks-handler.ts

chingor13 · 2020-08-20T21:17:21Z

src/github-handler/comment-handler/suggestion-patch-handler/scope-suggestion-hunks-handler.ts

+function getValidSuggestionHunks(
+  scope: Range[],
+  suggestedHunks: Hunk[]
+): {inScopeHunks: Hunk[]; outOfScopeHunks: Hunk[]} {


If both of these are already sorted, we should be able to do this in O(n + m) time rather than O(n log m) or O(m log n).

Depending on the expected sizes of the inputs, we might want to switch.

This is non-blocking for this PR.

This would be really interesting to tackle as there'd probably need to be a lot of testing to see what the average case is for users.
#87

…n pull requests (#105) * feat(patch text to hunk bounds): support regex for patch texts (#83) * fix(patch text to hunk bounds): support regex for patch texts * more comments and more tests * fix(framework-core): core-library get remote patch ranges (#84) * fix(framework-core): given files old content and new content, compute the valid hunks (#86) * fix(framework-core): parse raw changes to ranges * refactor(framework-core): rename modules, functions, & re-org project structure (#89) * fix(framework-core): hunk to patch object (#91) * feat: build failure message from invalid hunks (#90) * test: add failing stub and test for building the failure message * fix: implement message building * fix: use original line numbers in error message * docs: add docstring * docs: add note about empty input returning empty string * feat(framework-core): comment on prs given suggestions (#93) * feat(framework-core): main interface for create review on a pull request (#114) * feat(framework-core): main interface for create review on a pull request * docs: fix typo * nits and typos... * gts lint warning fix * fix(framework-core): combine review comments (#116) * fix(framework-core): collapsing timeline and inline comments into single review * test: fixed imports * added case when there are out of scope suggestions and no valid suggestions * feat(framework-core): return review number and variable renaming (#117) * feat(framework-core): return review number and variable renaming * lint Co-authored-by: Jeff Ching <chingor@google.com> Co-authored-by: Justin Beckwith <justin.beckwith@gmail.com> Co-authored-by: Benjamin E. Coe <bencoe@google.com>

TomKristie added 2 commits August 19, 2020 11:50

fix(framework-core): parse raw changes to ranges

def2503

remove tsconfig change

d2f4b8e

TomKristie requested review from chingor13, bcoe and a team August 19, 2020 16:02

TomKristie requested a review from a team as a code owner August 19, 2020 16:02

bcoe approved these changes Aug 19, 2020

View reviewed changes

chingor13 reviewed Aug 19, 2020

View reviewed changes

src/github-handler/comment-handler/suggestion-patch-handler/index.ts Outdated Show resolved Hide resolved

chingor13 reviewed Aug 19, 2020

View reviewed changes

src/github-handler/comment-handler/suggestion-patch-handler/fix-suggestion-hunks-handler.ts Outdated Show resolved Hide resolved

TomKristie added 2 commits August 20, 2020 16:33

more readable code and also store the invalid files

e6d366c

docs

0111d88

chingor13 approved these changes Aug 20, 2020

View reviewed changes

TomKristie merged commit bce5ef5 into googleapis:comment-pr Aug 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(framework-core): given files old content and new content, compute the valid hunks #86

fix(framework-core): given files old content and new content, compute the valid hunks #86

TomKristie commented Aug 19, 2020 •

edited

codecov bot commented Aug 19, 2020 •

edited

chingor13 Aug 19, 2020

TomKristie Aug 20, 2020 •

edited

chingor13 Aug 19, 2020

TomKristie Aug 20, 2020

chingor13 Aug 20, 2020

TomKristie Aug 20, 2020

fix(framework-core): given files old content and new content, compute the valid hunks #86

fix(framework-core): given files old content and new content, compute the valid hunks #86

Conversation

TomKristie commented Aug 19, 2020 • edited

codecov bot commented Aug 19, 2020 • edited

Codecov Report

chingor13 Aug 19, 2020

Choose a reason for hiding this comment

TomKristie Aug 20, 2020 • edited

Choose a reason for hiding this comment

chingor13 Aug 19, 2020

Choose a reason for hiding this comment

TomKristie Aug 20, 2020

Choose a reason for hiding this comment

chingor13 Aug 20, 2020

Choose a reason for hiding this comment

TomKristie Aug 20, 2020

Choose a reason for hiding this comment

TomKristie commented Aug 19, 2020 •

edited

codecov bot commented Aug 19, 2020 •

edited

TomKristie Aug 20, 2020 •

edited