last minute vacation taylor mcgann danny christensen russ taylor taylor mcgann danny christensen...

Post on 22-Dec-2015

225 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Last Minute Vacation

Taylor McGann

Danny Christensen

Russ Taylor

Agenda

Introduction

Data Collection

Data Summarization

Models

Conclusions

Future Research

Last minute vacation?...What?

How can we get a cheap ticket?

1What is the best day to purchase tickets?

2How far in advance should we purchase tickets?

3Are we getting the best price?

Ruby

Selenium WebDriver

Kayak

Loads of CSV files

"We don't want bots running about trying to book airline tickets. They tend to try to cram large suitcases in the overhead bin, and they prattle on about celebrities they

know while you are trying to watch the movie."

How about that data?

Name Type Description

Airline Text An airline. For example, AA equals American Airlines.

Arrive Text Arrival city as a three letter code

Arrive Date Date Arrival date.

Arrive Time Integer Flight arrival time in military hours.

Class Text Class of a flight. There are five different types of classes: Business, coach, mixed, premium, and first.

Depart Text Departure city as a three letter code.

Depart Date Date Departure date.

Depart Time Integer Flight departure time in military hours.

Departure Day Text Day of the week a flight departs.

Difference Integer Number of days between the download and departure data.

Download Date Date Download date or date a flight would have been purchased.

Download Day Text Day of the week flight data was downloaded.

Duration Integer Number of minutes in flight.

Equipment Text Type of plane.

Flight Integer Flight number.

Price Integer Total cost for a flight.

Record Integer Record number of each item in an individual csv. In the combined data set this number is meaningless.

Stops Integer Total number a stops or layovers for a flight.

Name Type Description

Airline Text An airline. For example, AA equals American Airlines.

Arrive Text Arrival city as a three letter code

Arrive Time Integer Flight arrival time in military hours.

Class Text Class of a flight. There are five different types of classes: Business, coach, mixed, premium, and first.

Departure Day Text Day of the week a flight departs.

Difference Integer Number of days between the download and departure data.

Download Day Text Day of the week flight data was downloaded.

Duration Integer Number of minutes in flight.

Price Integer Total cost for a flight.

Stops Integer Total number a stops or layovers for a flight.

Predicting Price

Final Variables

ANN

ANN

ANN

KNN

KNN

Linear Regression

Linear Regression

*NOTE: Table does not include all variables

1What is the best day to purchase tickets?

Tuesday Sunday

Conclusions

2How far in advance should we purchase tickets?

Conclusions1 Day

60 Days

120 Days

180 Days

3Are we getting the best price?

Conclusions

Hard to determine.

Too many variables.

Out of scope.

Future Research

Explore variables in isolation(E.g. destination, airline, etc.)

Add a "Buy/Buy Not" variable to do categorical analysis.

Future Research

Collect data over a longer period of time.

Isolate seasonal and cyclical trends.

Future Research

Questions

top related