SlideShare a Scribd company logo
PredictionIO
Adam Wang
Annie Cheng
Brett Clancy
Erin Chen
Han-Wen Chen
Siyuan Wang
Yiting Wang
Koober–
Koober
Meet Tim
Koober
Tim’s Challenge
+
Provide an efficient service for customers and taxi
drivers by…
Wait Times Revenue
Tim has some questions
How should taxis be allocated
among different locations?
Koober
What factors drive customers
to use taxis more?
But most importantly…
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Current Solutions
Koober
iTaxi
Shiny
TaxiPrediction
Our Solution
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Koober
Why Koober?
Demand Analysis + Prediction. Empower taxi
owners with valuable rider information backed by
past trends.
Koober
Anyone in Ride Sharing Industry. Plan more
appropriate routes and make better transportation
decisions.
So how does it work?
Koober
1. Choose your info need: analysis or prediction.
OR
So how does it work?
Koober
2. Adjust your input query parameters.
So how does it work?
Koober
3. Visualize taxi demand on the heat maps.
So how does it work?
Koober
4. Make more informed taxi resource decisions.
Let’s try it out!
https://koober-dashboard.herokuapp.com/
Koober
How does it really work?
Koober
Train and test machine learning models.
Display demand on interactive heat maps.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Raw Data
NYC Taxi & Limousine Commission:
Yellow Taxi Cab Dataset
Koober
Weather Underground API:
• Historical Weather Data
• Current Conditions
• 10-Day Forecast
Task 1
Demo Data Loader
Koober
Task 1
Development Production
• Generate fake traffic data
based on user-defined
parameters
• Simplify debugging and
validating the model
prediction result
• Process NYC Taxi Data and
integrate with the historical
weather data
• Incorporate Kafka to
facilitate importing large
amount of training data
Feature Extraction
Koober
Task 1
{
   "eventTime": "2017-01-20T18:54:07.000-05:00",
   "lat": 40.713802337646484,
   "lng": -77.0088882446289,
   "temperature": 0,
   "clear": 1,
   "fog": 0,
   "rain": 0,
   "snow": 0,
   "hail": 0,
   "thunder": 0,
   "tornado": 0
}
Weather
Location
Event Time
Temperature
Location Clustering
Koober
Preparator:
• K-Means Location Clustering
(200 clusters)
• Each cluster represents a
neighborhood geographically
• Demand Aggregation per Unit
Time Interval and Location
Cluster
Task 1
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 2
Gradient-Boosted Trees
Linear Regression with Stochastic Gradient Descent
Neural Network
Random Forest
Ridge Regression
Multiple Models. Supports many classic individual
and combined machine learning models.
Models
Koober
Task 2
PredictionIO - Train
Koober
Task 2
PredictionIO - Predict
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
What have we done?
Koober
Task 1: Generate dataset and extract features.
Task 2: Build various machine learning models.
Task 3: Develop the website and dashboard interface.
Task 4: Integrate Mapbox data visualization.
Koober
Task 4
User
Interface
Time
Location
Weather Predict Query
Predict Result
for Each
Algorithm
Map GL
component
Mapbox PredictionIO
Engine
Web App
Map Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Koober
How can the taxi industry utilize past data
to predict the future needs of their riders?
Looking Back
Koober. An open-source interactive website for
visualizing past and predicting future taxi demand.
Website Data
Machine
Learning
Visualization
Looking Forward
Koober
ModelsTraining Data Prediction Modes
More Cities
Uber/Lyft
Better Tuning
New Models
Wait Times
Prices
Our Mentor
Engineering and Open Source Ambassador at
Salesforce.com
Koober
James Ward
www.jamesward.com
Our Professor
Assistant Professor in Computer Science at
Cornell University
Koober
Ross Tate
http://www.cs.cornell.edu/~ross/
Contribute at
Koober
https://github.com/jamesward/
koober
Questions?
Koober

More Related Content

Koober Machine Learning

  • 1. PredictionIO Adam Wang Annie Cheng Brett Clancy Erin Chen Han-Wen Chen Siyuan Wang Yiting Wang Koober–
  • 2. Koober Meet Tim
  • 3. Koober Tim’s Challenge + Provide an efficient service for customers and taxi drivers by… Wait Times Revenue
  • 4. Tim has some questions How should taxis be allocated among different locations? Koober What factors drive customers to use taxis more?
  • 5. But most importantly… Koober How can the taxi industry utilize past data to predict the future needs of their riders?
  • 6. Current Solutions Koober iTaxi Shiny TaxiPrediction
  • 7. Our Solution Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Koober
  • 8. Why Koober? Demand Analysis + Prediction. Empower taxi owners with valuable rider information backed by past trends. Koober Anyone in Ride Sharing Industry. Plan more appropriate routes and make better transportation decisions.
  • 9. So how does it work? Koober 1. Choose your info need: analysis or prediction. OR
  • 10. So how does it work? Koober 2. Adjust your input query parameters.
  • 11. So how does it work? Koober 3. Visualize taxi demand on the heat maps.
  • 12. So how does it work? Koober 4. Make more informed taxi resource decisions.
  • 13. Let’s try it out! https://koober-dashboard.herokuapp.com/ Koober
  • 14. How does it really work? Koober Train and test machine learning models. Display demand on interactive heat maps.
  • 15. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 16. Raw Data NYC Taxi & Limousine Commission: Yellow Taxi Cab Dataset Koober Weather Underground API: • Historical Weather Data • Current Conditions • 10-Day Forecast Task 1
  • 17. Demo Data Loader Koober Task 1 Development Production • Generate fake traffic data based on user-defined parameters • Simplify debugging and validating the model prediction result • Process NYC Taxi Data and integrate with the historical weather data • Incorporate Kafka to facilitate importing large amount of training data
  • 18. Feature Extraction Koober Task 1 {    "eventTime": "2017-01-20T18:54:07.000-05:00",    "lat": 40.713802337646484,    "lng": -77.0088882446289,    "temperature": 0,    "clear": 1,    "fog": 0,    "rain": 0,    "snow": 0,    "hail": 0,    "thunder": 0,    "tornado": 0 } Weather Location Event Time Temperature
  • 19. Location Clustering Koober Preparator: • K-Means Location Clustering (200 clusters) • Each cluster represents a neighborhood geographically • Demand Aggregation per Unit Time Interval and Location Cluster Task 1
  • 20. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 21. Koober Task 2 Gradient-Boosted Trees Linear Regression with Stochastic Gradient Descent Neural Network Random Forest Ridge Regression Multiple Models. Supports many classic individual and combined machine learning models. Models
  • 22. Koober Task 2 PredictionIO - Train
  • 23. Koober Task 2 PredictionIO - Predict
  • 24. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 25. What have we done? Koober Task 1: Generate dataset and extract features. Task 2: Build various machine learning models. Task 3: Develop the website and dashboard interface. Task 4: Integrate Mapbox data visualization.
  • 26. Koober Task 4 User Interface Time Location Weather Predict Query Predict Result for Each Algorithm Map GL component Mapbox PredictionIO Engine Web App Map Visualization
  • 27. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 28. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 29. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 30. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 31. Koober How can the taxi industry utilize past data to predict the future needs of their riders? Looking Back Koober. An open-source interactive website for visualizing past and predicting future taxi demand. Website Data Machine Learning Visualization
  • 32. Looking Forward Koober ModelsTraining Data Prediction Modes More Cities Uber/Lyft Better Tuning New Models Wait Times Prices
  • 33. Our Mentor Engineering and Open Source Ambassador at Salesforce.com Koober James Ward www.jamesward.com
  • 34. Our Professor Assistant Professor in Computer Science at Cornell University Koober Ross Tate http://www.cs.cornell.edu/~ross/
  • 35. Contribute at Koober https://github.com/jamesward/ koober
  • 36. Questions? Koober
© 2024 SlideShare from Scribd

玻璃钢生产厂家玻璃钢动物雕塑 驴龙亭玻璃钢雕塑价格河源玻璃钢雕塑制品玻璃钢雕塑制作工艺有几种潮州玻璃钢动物雕塑图片南充商场美陈知名玻璃钢雕塑规格尺寸新民玻璃钢花盆花器广场标识校园玻璃钢雕塑制造雨花台玻璃钢雕塑公司玻璃钢花盆长兴玻璃钢雕塑打磨技巧昭通市玻璃钢雕塑设计多少钱浦口商场装饰美陈盐城商场美陈策划大型商场美陈方案哈尔滨校园玻璃钢雕塑深圳坪地镇玻璃钢雕塑厂招聘玻璃钢佛像雕塑报价内乡玻璃钢雕塑设计大型玻璃钢雕塑销售价格湖北医院玻璃钢雕塑阜阳大型户外玻璃钢雕塑厂家商场夏季美陈布置武汉肖像校园玻璃钢雕塑厂家玻璃钢仿铜啤酒厂人物雕塑张家口玻璃钢仿铜雕塑河池玻璃钢卡通雕塑上饶玻璃钢马雕塑三月商场美陈香港通过《维护国家安全条例》两大学生合买彩票中奖一人不认账让美丽中国“从细节出发”19岁小伙救下5人后溺亡 多方发声单亲妈妈陷入热恋 14岁儿子报警汪小菲曝离婚始末遭遇山火的松茸之乡雅江山火三名扑火人员牺牲系谣言何赛飞追着代拍打萧美琴窜访捷克 外交部回应卫健委通报少年有偿捐血浆16次猝死手机成瘾是影响睡眠质量重要因素高校汽车撞人致3死16伤 司机系学生315晚会后胖东来又人满为患了小米汽车超级工厂正式揭幕中国拥有亿元资产的家庭达13.3万户周杰伦一审败诉网易男孩8年未见母亲被告知被遗忘许家印被限制高消费饲养员用铁锨驱打大熊猫被辞退男子被猫抓伤后确诊“猫抓病”特朗普无法缴纳4.54亿美元罚金倪萍分享减重40斤方法联合利华开始重组张家界的山上“长”满了韩国人?张立群任西安交通大学校长杨倩无缘巴黎奥运“重生之我在北大当嫡校长”黑马情侣提车了专访95后高颜值猪保姆考生莫言也上北大硕士复试名单了网友洛杉矶偶遇贾玲专家建议不必谈骨泥色变沉迷短剧的人就像掉进了杀猪盘奥巴马现身唐宁街 黑色着装引猜测七年后宇文玥被薅头发捞上岸事业单位女子向同事水杯投不明物质凯特王妃现身!外出购物视频曝光河南驻马店通报西平中学跳楼事件王树国卸任西安交大校长 师生送别恒大被罚41.75亿到底怎么缴男子被流浪猫绊倒 投喂者赔24万房客欠租失踪 房东直发愁西双版纳热带植物园回应蜉蝣大爆发钱人豪晒法院裁定实锤抄袭外国人感慨凌晨的中国很安全胖东来员工每周单休无小长假白宫:哈马斯三号人物被杀测试车高速逃费 小米:已补缴老人退休金被冒领16年 金额超20万

玻璃钢生产厂家 XML地图 TXT地图 虚拟主机 SEO 网站制作 网站优化