Fix vendor scanning for windows #684

another-rex · 2023-12-01T03:29:47Z

Fixes #657

Did some minor refactoring to move vendor scanning code into separate file. Also added a test for vendor scanning.

This does not add a unit test for crlf change, as git might change line endings of the fixture files upon checkout. (maybe add a .gitattribute file in the future to lock this down to not change line endings.).

Manually tested it to confirm the change works.

The only "code" (non test, non refactor) change is line 105 in pkg/osvscanner/vendored_libs.go

codecov-commenter · 2023-12-01T03:31:06Z

Codecov Report

Attention: 29 lines in your changes are missing coverage. Please review.

Comparison is base (c75d056) 79.08% compared to head (66b76d0) 80.05%.
Report is 2 commits behind head on main.

Files	Patch %	Lines
pkg/osvscanner/vendored_libs.go	61.84%	21 Missing and 8 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #684      +/-   ##
==========================================
+ Coverage   79.08%   80.05%   +0.96%     
==========================================
  Files          86       87       +1     
  Lines        6121     6127       +6     
==========================================
+ Hits         4841     4905      +64     
+ Misses       1075     1004      -71     
- Partials      205      218      +13

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

oliverchang

Thanks Rex!

oliverchang · 2023-12-01T03:49:30Z

pkg/osvscanner/vendored_libs.go

+			return nil
+		}
+
+		windowsEnding := []byte("\r\n")


I assume the bits here are the only functional change?

In that case personally I'd prefer the refactor in a dedicated PR since it's such a huge change 😅

Oops, missed the message, other than adding the tests and the 2 lines for replacing line endings, no code changes has been made, just copied and pasted into a separate file to make it easier to read.

oliverchang · 2023-12-01T03:50:17Z

pkg/osvscanner/vendored_libs.go

+				return err
+			}
+
+			buf = bytes.ReplaceAll(buf, windowsEnding, unixEnding)


Hmm, are there going to be upstream repos with windows endings as the source of truth?

I wonder if we need to be doing this normalization on our indexing side also?

I haven't thought about this, it could be the case, though I believe the default is LF for git, so this will be pretty rare.

Can add some logging into the indexer to see if this is the case.

fwiw I feel like that would be safest so you don't have to think about non-line endings being replaced - I'm pretty sure (especially with what we're discussed in the past @another-rex) that shouldn't be the case because it should be the same underlying bytes (and I expect that's what Git sees too?), but at the end of the day I personally would sleep better knowing that there was a straightforward path to normalization...

I don't know too much about this area of things, but for indexing could you actually index both with and without the replacement (and even do that regardless of if a repo is using \n or \r\n)? that way the downstream wouldn't ever have to actually care right?

Sort of? It will double the storage in datastore and increase the indexer runtime, for I don't think much benefit.

Are you referring to e.g. UTF-8 strings potentially having the bytes 13 10 appearing later on in a e.g. 3 or 4 byte wide character/rune? (I'm not sure if these character exist, but it is a possibility) This will be pretty rare since C or C++ compilers don't commonly support unicode characters in the source code afaik.

Either way I think that can be avoided by adding the same normalization on the indexer side.

another-rex added 2 commits December 1, 2023 12:00

Fix vendored deps for windows

8180ef4

Add test for scanDirVendoredLib

86c959d

another-rex requested a review from oliverchang December 1, 2023 03:29

Fix lints

b8000a7

oliverchang reviewed Dec 1, 2023

View reviewed changes

another-rex added 2 commits December 8, 2023 15:33

Fix test on windows

33f20cd

Merge branch 'main' into fix-vendor-windows

66b76d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix vendor scanning for windows #684

Fix vendor scanning for windows #684

another-rex commented Dec 1, 2023

codecov-commenter commented Dec 1, 2023 •

edited

oliverchang left a comment

oliverchang Dec 1, 2023

another-rex Dec 1, 2023

G-Rath Dec 1, 2023

another-rex Dec 4, 2023

oliverchang Dec 1, 2023

another-rex Dec 1, 2023

another-rex Dec 1, 2023

G-Rath Dec 1, 2023

another-rex Dec 1, 2023

Fix vendor scanning for windows #684

Are you sure you want to change the base?

Fix vendor scanning for windows #684

Conversation

another-rex commented Dec 1, 2023

codecov-commenter commented Dec 1, 2023 • edited

Codecov Report

oliverchang left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-commenter commented Dec 1, 2023 •

edited