Air pollution has grown in New York as a result of industrialization and growing urbanization. As the respiratory system is the first organ affected negatively by air pollution, exposure to high levels of pollutants can make respiratory illnesses worse. Children in New York State had an even greater frequency, with an 8.7% prevalence in 2014 rising to 10.0% in 2015, with the largest burden in New York City. According to previous studies on asthma, air pollution may make the condition more common. Visually displaying asthma (asthma hospitalization, asthma emergency department visits, and asthma death rate) by county among residents of New York in 2020 can offer insight for interventions and policies seeking to address air pollution hazards (fine particulate matter (PM2.5) and ozone (O3)) affecting public health in the United States.
Data Source: Data on air pollutants, asthma, and population
in 2020 were all sourced from the EPA Air Data, New York State
Department of Health, and the United States’ Census Bureau respectively.
Methods: Asthma and air pollutant data by county level
were collected and joined using QGIS software.
* Air pollution parameters were considered independent modeling
variables, using spatial interpolation and zonal statistics. * Spatial
clusters were mapped and analyzed in QGIS to visualize results. Second,
I linked air pollutant mean concentration to asthma rate by county
level. * Asthma disadvantage index was conducted to adjust three asthma
rates; then using Geoda run the regression analysis. * Bivariate Local
Moran’s I analyses were run using Geoda software to determine clustering
in asthma death rate.
Most of us have experienced waiting at the airport for the delayed flight. As the holiday season approaches, departure on-time have becoming more and more challenging. This year, there are already more than 4,000 flights delayed as holiday travel spikes in the U.S. The flight delay issue especially concerns busy metropolitan airports like JFK.
Using JFK departed flights as an example, we’re interested in exploring the factors that are potentially related with flight delay (e.g., airlines, weather, and COVID conditions). By doing so, we’d like to give the holiday travelers a better sense of what to expect regarding flight delay for this holiday season, in order to make better travel plans accordingly.
As in the draft project proposal, we would like to do some reviews on the three airports serving New York City, i.e., JFK, LGA, and EWR. The initial questions we would like to focus on included the comparisons among these three airports, such as restaurants, shops and stores, lounges, and other facilities/services.
However, considering the availability and scale of the accessible and available data, we switched our main topic to focus on JFK Airport’s delay and cancellation data from 11/1/2021 to 1/31/2022. And the weather and COVID cases data within that time range were accordingly. Based on these information, our final questions are below:
We used three primary datasets for exploration, visualization, and statistical analysis, and an additional supplemental dataset for interactive mapping.
Flight delay and cancellation data for JFK departures was obtained from the Bureau of Transportation Statistics (BTS). We first wrote a function to iterate on the reading-in process for each airline of interest from November 2021 to January 2022. Some new variable were created for subsequent analysis use, including scheduled hours for flight’s departures, year, month, day, etc. Delay minutes were manually calculated as the time difference in actual departure time and scheduled departure time. Flights with actual elapsed time minutes equal to 0 were treated as cancellation. Air carriers were recoded as airline names to be more clear.
Hourly and daily weather information from 11/1/2021 to 1/31/2022 was obtained from the National Oceanic and Atmospheric Administration (NOAA). We selected the specific zip code for JFK and requested the raw data thereof. To align the time unit of weather with flight information, considering data availability, we picked the weather at the 51th minute to represent the weather of the hour, and the weather at 23:59 to represent the weather condition of the day. Only date and hourly/daily weather condition of interest were kept in the resulted tidied dataset.
Daily COVID cases count from 11/1/2021 to 1/31/2022 was obtained, by API, from NYC OpenData, provided by the NYC Department of Health and Mental Hygiene (DOHMH). The resulted dataset included date, year, month, day, and daily case counts.
An additional source for the interactive map was U.S. domestic airport location information from the Humanitarian Data Exchange, where we extracted the latitude and longitude for those airports with delay and/or cancellation records.
Check it on website