Applied Ai (Papers, Articles & Videos, in production with results)
Figuring out how to implement your ML project? Learn from How other organizations have done it in past?:
- How problem is framed (e.g., personalization as recsys vs. search vs. sequences)
- What machine learning techniques worked (and sometimes, what didn't)
- Why it works, the science behind it with research, literature, and references
- What real-world results were achieved (so you can better assess ROI)
1.Practices 2. Failures 3. Data Quality 4. Data Engineering 5. Classification 6. Regression 7. Computer Vision 8. Natural Language Processing 9. Sequence Modelling 10. Optimization 11. Validation and A/B Testing
Topic | Paper / Article / Video | Company |
---|---|---|
160k+ High School Students Will Graduate Only If a Model Allows Them to | -- | International Baccalaureate |
When It Comes to Gorillas, Google Photos Remains Blind | -- | Google |
An Algorithm That ‘Predicts’ Criminality Based on a Face Sparks a Furor | -- | Harrisburg University |
Topic | Paper / Article / Video | Company |
---|---|---|
Monitoring Data Quality at Scale with Statistical Modeling | -- | Uber |
An Approach to Data Quality for Netflix Personalization Systems | -- | Netflix |
Automating Large-Scale Data Quality Verification | Paper | Amazon |
Meet Hodor — Gojek’s Upstream Data Quality Tool | -- | Gojek |
Reliable and Scalable Data Ingestion at Airbnb | -- | Airbnb |
Topic | Paper / Article / Video | Company |
---|---|---|
Zipline: Airbnb’s Machine Learning Data Management Platform | -- | Airbnb |
Sputnik: Airbnb’s Apache Spark Framework for Data Engineering | -- | Airbnb |
Introducing Feast: an open source feature store for machine learning | Code | Gojek |
Feast: Bridging ML Models and Data | -- | Gojek |
Amundsen — Lyft’s Data Discovery & Metadata Engine | -- | Lyft |
Open Sourcing Amundsen: A Data Discovery And Metadata Platform | Code | Lyft |
Metacat: Making Big Data Discoverable and Meaningful at Netflix | -- | Netflix |
How We Improved Data Discovery for Data Scientists at Spotify | -- | Spotify |
Topic | Paper / Article / Video | Company |
---|---|---|
Using Machine Learning to Predict Value of Homes On Airbnb | -- | Airbnb |
Using Machine Learning to Predict the Value of Ad Requests | -- | Twitter |
Open-Sourcing Riskquant, a Library for Quantifying Risk | Code | NetFlix |
Topic | Paper / Article / Video | Company |
---|---|---|
How Trip Inferences and Machine Learning Optimize Delivery Times on Uber Eats | -- | Uber |
Next-Generation Optimization for Dasher Dispatch at DoorDash | -- | DoorDash |
Matchmaking in Lyft Line (Part 1) (Part 2) (Part 3) |
-- | Lyft |
The Data and Science behind GrabShare Carpooling | Help me in Updating Paper | Grab |
Topic | Paper / Article / Video | Company |
---|---|---|
The Reusable Holdout: Preserving Validity in Adaptive Data Analysis | Paper | Google |
Detecting Interference: An A/B Test of A/B Tests | -- | LinkedIn |
Building Inclusive Products Through A/B Testing | Paper | LinkedIn |
Experimenting to Solve Cramming | -- | Twitter |
Announcing a New Framework for Designing Optimal Experiments with Pyro | Paper Paper |
Uber |
Enabling 10x More Experiments with Traveloka Experiment Platform | -- | Traveloka |
Large scale experimentation at StitchFix | Paper | Stitch Fix |
Modeling Conversion Rates and Saving Millions Using Kaplan-Meier and Gamma Distributions | Code | Better |