Skip to content

Python package to perform mixed-type distance calculations

License

Notifications You must be signed in to change notification settings

gibsramen/mtdist

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mtdist

mtdist is a Python package for computing mixed-type distance metrics on high-dimensional data. Standard distance metrics (Euclidean, Manhattan, etc.) are usually restricted to numeric data, making integration with other data types (categorical, ordinal, etc.) difficult. Distance metrics that are built to handle mixed-type data, such as Gower distance (Gower 1971), are often only available in R, increasing the burden on data scientists.

mtdist aims to bring these valuable distance metrics to Python, allowing researchers to more easily analyze mixed-type data for clustering, visualization, and more.

References

[1] Gower, J. C. "A General Coefficient of Similarity and Some of Its Properties." Biometrics 27, no. 4 (1971): 857-71. doi:10.2307/2528823.

About

Python package to perform mixed-type distance calculations

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages