Traversing unrelated histories? #212

GunArm · 2021-08-05T17:17:47Z

Looking for a little info on how it traverses the tree and/or how it would react to unrelated/unconnected histories within in a repo.

I'm trying to get cumulative statistics on a series of repos shared within a team. I made a dummy repo and added all the other repos as remotes, and fetched all their content so I have one repo with multiple unrelated histories. I tried to run repostat on it, but it seems to only be traversing the tree that a branch is checked out on. Is my inference about what is happening likely? Is there any way (flag etc) to force it to traverse disconnected histories/trees?

When I get some time I will try to make a junk merge between some random branches on each of the seperate trees, just to connect them and see if it works that way

pulkomandy · 2021-08-05T18:14:35Z

Currently repostat is designed to work with a mostly linear history. It will not work even with a big merge commit, as in merge commits, only one of the parents is explored:

repostat/analysis/gitdata.py

Line 70 in 43fc4e6

elif len(commit.parents) == 1:

I think the best way to do what you want is creating multiple instances of GitHistory for each repo to analyze (https://github.com/vifactor/repostat/blob/master/analysis/gitrepository.py#L16) and then aggregate the stats for all of them to produce a single report? But currently that's not possible, some code changes will be needed.

Or maybe it's possible to do it inside the GitRepository class by modifying the creation of self.whole_history_df, self.linear_history_df, self._head_revision (not sure what you'd put there if there are multiple heads), and self._tags (probably doesn't make a lot of sense in your case either).

I'm not sure how easy it is to generate whole history and linear history for multiple starting points.

One condition for this to work is to be sure that the different heads don't end up merging to some common history, otherwise, the commits before that splitting point would be counted twice. But in your case of independant repositories, this should be fine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traversing unrelated histories? #212

Traversing unrelated histories? #212

GunArm commented Aug 5, 2021

pulkomandy commented Aug 5, 2021

Traversing unrelated histories? #212

Traversing unrelated histories? #212

Comments

GunArm commented Aug 5, 2021

pulkomandy commented Aug 5, 2021