Skip to content

MvMukesh/Applied-Ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 

Repository files navigation

Applied-Ai

Applied Ai (Papers, Articles & Videos, in production with results)


Figuring out how to implement your ML project? Learn from How other organizations have done it in past?:

  • How problem is framed (e.g., personalization as recsys vs. search vs. sequences)
  • What machine learning techniques worked (and sometimes, what didn't)
  • Why it works, the science behind it with research, literature, and references
  • What real-world results were achieved (so you can better assess ROI)

Content Table

1.Practices 2. Failures 3. Data Quality 4. Data Engineering 5. Classification 6. Regression 7. Computer Vision 8. Natural Language Processing 9. Sequence Modelling 10. Optimization 11. Validation and A/B Testing


Practices

Topic Paper / Article / Video Company
Practical Recommendations for Gradient-Based Training of Deep Architectures Paper Yoshua Bengio
Machine Learning: The High Interest Credit Card of Technical Debt Paper
Paper
Google
Rules of Machine Learning: Best Practices for ML Engineering -- Google
On Challenges in Machine Learning Model Management -- Amazon
Machine Learning in production: the Booking.com approach -- Booking
150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com Paper Booking
Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department -- Stitch Fix
Beware the Data Science Pin Factory: The Power of the Full-Stack Data Science Generalist -- Stitch Fix

Failures

Topic Paper / Article / Video Company
160k+ High School Students Will Graduate Only If a Model Allows Them to -- International Baccalaureate
When It Comes to Gorillas, Google Photos Remains Blind -- Google
An Algorithm That ‘Predicts’ Criminality Based on a Face Sparks a Furor -- Harrisburg University

Data Quality

Topic Paper / Article / Video Company
Monitoring Data Quality at Scale with Statistical Modeling -- Uber
An Approach to Data Quality for Netflix Personalization Systems -- Netflix
Automating Large-Scale Data Quality Verification Paper Amazon
Meet Hodor — Gojek’s Upstream Data Quality Tool -- Gojek
Reliable and Scalable Data Ingestion at Airbnb -- Airbnb

Data Engineering

Topic Paper / Article / Video Company
Zipline: Airbnb’s Machine Learning Data Management Platform -- Airbnb
Sputnik: Airbnb’s Apache Spark Framework for Data Engineering -- Airbnb
Introducing Feast: an open source feature store for machine learning Code Gojek
Feast: Bridging ML Models and Data -- Gojek
Amundsen — Lyft’s Data Discovery & Metadata Engine -- Lyft
Open Sourcing Amundsen: A Data Discovery And Metadata Platform Code Lyft
Metacat: Making Big Data Discoverable and Meaningful at Netflix -- Netflix
How We Improved Data Discovery for Data Scientists at Spotify -- Spotify

Classification

Topic Paper / Article / Video Company
High-Precision Phrase-Based Document Classification on a Modern Scale Paper LinkedIn
Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing Paper WalmartLabs
Large-scale Item Categorization for e-Commerce Paper DianPing
eBay
Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks Paper NAVER
Categorizing Products at Scale -- Shopify
Learning to Diagnose with LSTM Recurrent Neural Networks Paper Google
Discovering and Classifying In-app Message Intent at Airbnb -- Airbnb
How We Built the Good First Issues Feature -- GitHub
Teaching Machines to Triage Firefox Bugs -- Mozilla
Testing Firefox More Efficiently with Machine Learning -- Mozilla
Using ML to Subtype Patients Receiving Digital Mental Health Interventions Paper Microsoft
Prediction of Advertiser Churn for Google AdWords Paper Google

Regression

Topic Paper / Article / Video Company
Using Machine Learning to Predict Value of Homes On Airbnb -- Airbnb
Using Machine Learning to Predict the Value of Ad Requests -- Twitter
Open-Sourcing Riskquant, a Library for Quantifying Risk Code NetFlix

Computer Vision

Topic Paper / Article / Video Company
Categorizing Listing Photos at Airbnb -- Airbnb
Amenity Detection and Beyond — New Frontiers of Computer Vision at Airbnb -- Airbnb
Powered by AI: Advancing product understanding and building new shopping experiences -- Facebook
Creating a Modern OCR Pipeline Using Computer Vision and Deep Learning -- Dropbox
How we Improved Computer Vision Metrics by More Than 5% Only by Cleaning Labelling Errors -- Deepomatic
A Neural Weather Model for Eight-Hour Precipitation Forecasting Paper Google
Machine Learning-based Damage Assessment for Disaster Relief Paper Google
RepNet: Counting Repetitions in Videos Paper Google
Converting Text to Images for Product Discovery Paper Amazon
How Disney uses PyTorch for Animated Character Recognition -- Disney
Image Captioning as an Assistive Technology Video IBM

Natural Language Processing

Topic Paper / Article / Video Company
Abusive Language Detection in Online User Content Paper Yahoo
How Natural Language Processing Helps LinkedIn Members Get Support Easily -- LinkedIn
Building Smart Replies for Member Messages -- LinkedIn
Smart Reply: Automated Response Suggestion for Email Paper Google
SmartReply for YouTube Creators -- Google
Using Neural Networks to Find Answers in Tables Paper Google
A Scalable Approach to Reducing Gender Bias in Google Translate -- Google
Assistive AI Makes Replying Easier -- Microsoft
AI Advances to Better Detect Hate Speech -- Facebook
A State-of-the-Art Open Source Chatbot Paper Facebook
A Highly Efficient, Real-Time Text-to-Speech System Deployed on CPUs -- Facebook
Goal-Oriented End-to-End Conversational Models with Profile Features in a Real-World Setting Paper Amazon
How Gojek Uses NLP to Name Pickup Locations at Scale -- GoJek
Give Me Jeans not Shoes: How BERT Helps Us Deliver What Clients Want -- Stitch Fix
The State-of-the-art Open-Domain Chatbot in Chinese and English Paper Baidu
Deep Learning to Translate Between Programming Languages Paper Facebook
PEGASUS: A State-of-the-Art Model for Abstractive Text Summarization Paper) (Code Google

Sequence Modelling

Topic Paper / Article / Video Company
Recommending Complementary Products in E-Commerce Push Notifications with Mixture Models Paper Alibaba
Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction Paper Alibaba
Search-based User Interest Modeling with Lifelong Sequential Behavior Data for CTR Prediction Paper Alibaba
Learning to Diagnose with LSTM Recurrent Neural Networks Paper Google
Deep Learning for Understanding Consumer Histories Paper Zalando
Continual Prediction of Notification Attendance with Classical and Deep Network Approaches Paper Telefonica
Using Recurrent Neural Network Models for Early Detection of Heart Failure Onset Paper Sutter Health
Doctor AI: Predicting Clinical Events via Recurrent Neural Networks Paper Sutter Health

Optimization

Topic Paper / Article / Video Company
How Trip Inferences and Machine Learning Optimize Delivery Times on Uber Eats -- Uber
Next-Generation Optimization for Dasher Dispatch at DoorDash -- DoorDash
Matchmaking in Lyft Line (Part 1)
(Part 2)
(Part 3)
-- Lyft
The Data and Science behind GrabShare Carpooling Help me in Updating Paper Grab

Validation and A/B Testing

Topic Paper / Article / Video Company
The Reusable Holdout: Preserving Validity in Adaptive Data Analysis Paper Google
Detecting Interference: An A/B Test of A/B Tests -- LinkedIn
Building Inclusive Products Through A/B Testing Paper LinkedIn
Experimenting to Solve Cramming -- Twitter
Announcing a New Framework for Designing Optimal Experiments with Pyro Paper
Paper
Uber
Enabling 10x More Experiments with Traveloka Experiment Platform -- Traveloka
Large scale experimentation at StitchFix Paper Stitch Fix
Modeling Conversion Rates and Saving Millions Using Kaplan-Meier and Gamma Distributions Code Better