Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

String compression #556

Open
JohannesLichtenberger opened this issue Nov 14, 2022 · 13 comments
Open

String compression #556

JohannesLichtenberger opened this issue Nov 14, 2022 · 13 comments

Comments

@JohannesLichtenberger
Copy link
Member

FSST: Fast Random Access String Compression

@JohannesLichtenberger
Copy link
Member Author

Useable either as a replacement of the current dictionary encoding (used to store object field names or element names/attribute names) or simply for the text compression of string values.

@AlvinKuruvilla
Copy link
Contributor

AlvinKuruvilla commented Nov 16, 2022

I feel the best way to implement this is to keep this as a separate library so we can maintain it separately and add it as a dependency. I also found the original c++ code used in the paper

@AlvinKuruvilla
Copy link
Contributor

This is the current implementation I have... if we want, we can migrate this to the Sirix organization and continue development from

@JohannesLichtenberger
Copy link
Member Author

I think you could simply develop it as a separate library and I can add it as a dependency once it's finished and published to maven central :-)

@AlvinKuruvilla
Copy link
Contributor

I think you could simply develop it as a separate library and I can add it as a dependency once it's finished and published to maven central :-)

Good point

@JohannesLichtenberger
Copy link
Member Author

@AlvinKuruvilla did you made any advances?

@JohannesLichtenberger
Copy link
Member Author

@AlvinKuruvilla ping :-)

@AlvinKuruvilla
Copy link
Contributor

Sorry @JohannesLichtenberger, not lately. I just finished school. I hope to get some more work done now that I have some free time. I have a feeling this is going to be a longer-term issue, especially with all of the tests that need to be ported over

@Aminmalek
Copy link
Contributor

is this issue fixed?
@JohannesLichtenberger

@JohannesLichtenberger
Copy link
Member Author

No, still open. But I think low priority

@AlvinKuruvilla
Copy link
Contributor

Yeah, sorry about that @JohannesLichtenberger , I didn't mean to ghost you like that. I've been busy with school and those projects. I'm still interested in the project just haven't had the time as of late., I have most of the basic building blocks coded up for the most part, and I had some tests written up. Can we consider making this part of the org? It's nowhere near ready or stable, but I'm familiar enough with the codebase to set things up so people can at least look at open issues and consider contributing. That way, we can drive some progress when I can't work on it.

@JohannesLichtenberger
Copy link
Member Author

@Aminmalek wanted to work on this, so you may transfer it to the organization.

@Aminmalek
Copy link
Contributor

@AlvinKuruvilla we can work on this together.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants