Xiaorui (Jeremy) Zhu
01/25/2026
The aircraft.xlsx dataset contains columns for MD (United States Air Force (USAF) ), FH (Flight Hours), FY (Fiscal Year), Cost, and Gallons.
## [1] "Bombers" "Fighters" "Trainers"
## [4] "UAV_Drones" "Tankers_Transporters"
## # A tibble: 6 × 6
## Type MD FY FH Gallons Cost
## <chr> <chr> <dbl> <dbl> <dbl> <dbl>
## 1 Trainer AT-38 1996 12517 6681614 5641569
## 2 Trainer AT-38 1997 11656 7707001 6506680
## 3 Trainer AT-38 1998 12619 9749881 9526089
## 4 Trainer AT-38 1999 13132 10534024 9343636
## 5 Trainer AT-38 2000 14400 10769237 7242603
## 6 Trainer AT-38 2001 12674 9680191 10533477
Look at the summary information for this data. Anything seem odd?
Which aircraft MDs are represented?
Are there any missing years between 1996-2014 in this data?
90% of flying hours fall under what value?
What is the spread of the range of costs?
How would you describe the distribution of flying hours?
If we wanted to focus on only the trainers with the largest variance in flying hours, which MDs would we select?
Are all FYs equally represented?