Skip to content

Version 0.2.1

Latest
Compare
Choose a tag to compare
@hamelin hamelin released this 26 Feb 15:56
· 2 commits to master since this release
b477e78

What's Changed

  • Added a masking token for pruned vocabulary by @cjweir in #47
  • added token frequency to the kernel functions by @cjweir in #48
  • Skip n-grams and better handling of directional windowing (including support for non-symmetric windows) via "reverse". by @lmcinnes in #50
  • Kernel and Window args by @cjweir in #49
  • Kernel offsets by @cjweir in #51
  • Linear Optimal Transport by @lmcinnes in #52
  • EMTokenCooccurrence update by @cjweir in #53
  • EMTCV skip_grams by @cjweir in #54
  • Fixes for spherical vectors (e.g. word vectors) by @lmcinnes in #55
  • Add simple transformers by @lmcinnes in #56
  • Refactoring by @cjweir in #57
  • ApproxWassersteinVectorizer (LA vectorizer), supervised InformationWeight by @lmcinnes in #58
  • BayesEM and Dynamic COO memory by @cjweir in #59
  • Handle disabling JIT for to_fixed_tuple by @lmcinnes in #60
  • added document_context to TokenCooccurrenceVectorizer() by @jc-healy in #61
  • Change default normalization in TCV by @lmcinnes in #63
  • Document_context EM by @cjweir in #65
  • Added a summarize_embedding function to vectorizers.utils. by @jc-healy in #66
  • Refactored the variable document_context to the more intuitively name… by @jc-healy in #67
  • added a helper function to utils called categorical_columns_to_list. by @jc-healy in #68
  • Add SinkhornVectorizer for larger distributions by @lmcinnes in #69
  • CategoricalColumnTransformer documentation by @jc-healy in #70
  • Add dask for simple parallel computation in the word vectorizer by @lmcinnes in #71
  • Count feature compressor by @lmcinnes in #72
  • Fix doc builds per new docs required packages by @lmcinnes in #73
  • Pandoc now required for doc build by @lmcinnes in #74
  • Make some changes to how info weight works; New README. by @lmcinnes in #75
  • SlidingWindowTransformer for working with time-series like data by @lmcinnes in #76
  • Documentation Skeleton by @lmcinnes in #77
  • Add document_vectorization basic tutorial by @lmcinnes in #78
  • More documentation updates by @lmcinnes in #79
  • Added a sep parameter to utils.summarize_embedding() to allow for dif… by @jc-healy in #80
  • Generator based WassersteinVectorizer for sliding window text embeddings by @lmcinnes in #81
  • Fix up some of the test coverage. by @lmcinnes in #82
  • Refactor transformers by @lmcinnes in #83
  • SequentialDifferenceTransformer, function kernels by @lmcinnes in #85
  • Add named function kernels for sliding windows by @lmcinnes in #86
  • Add Compression Vectorizers by @lmcinnes in #87
  • added EdgeListVectorizer by @jc-healy in #88
  • Added max_unique_tokens by @cjweir in #89
  • Ensure contiguous arrays in optimal transport vectorizers by @lmcinnes in #90
  • Ignore Jupyter artifacts by @hamelin in #92
  • Add unit test for CFC transformer by @hamelin in #91
  • Fix documentation by @cakiki in #95
  • Fixed categorical column transformer example in our documentation by @jc-healy in #98
  • [Typo] Rename enviornment.yml to environment.yml by @cakiki in #100
  • The big cooccurrence refactor by @cjweir in #99
  • TimedTCV bug fix by @cjweir in #102
  • Some documentation cleanup by @lmcinnes in #103
  • [Typo] Update README.rst by @cakiki in #104
  • Added some taxonomy of sequences documentation. by @jc-healy in #105
  • Testing how well an html page integrates with our sphinx documentation. by @jc-healy in #106
  • Revert "Testing how well an html page integrates with our sphinx documentation." by @jc-healy in #107
  • Added SignatureVectorizer (iisignature) by @jh83775 in #108
  • Overloaded addition for NgramVectorizer for easy batch processing by @jc-healy in #110
  • Take iisignature off of the list of direct deps by @hamelin in #112
  • Bump version to 0.2 by @hamelin in #113
  • Update preprocessing.py by @cakiki in #116
  • [MINOR] Update README.rst by @cakiki in #117
  • Minor fixes by @lmcinnes in #120
  • Fixed a memory allocation bug in MultiSetCooccurrenceVectorizer adde… by @jc-healy in #119
  • Add publishing step to devops config by @gclendenning in #121
  • Add additional branch triggers by @gclendenning in #125
  • Ensure all tests pass before release by @gclendenning in #127
  • Refactored WassersteinVectorizer by @jc-healy in #126
  • updated version number to push to pypi by @jc-healy in #129
  • Fix version number by @hamelin in #131

New Contributors

Full Changelog: v0.01...release-0.2.1