Danish Abbasi, Liulu Yue, Rongzi Xie, Wenxiao Wang
Project-long Course Project as part of COSI 116A: Information Visualization, taught by Prof. Dylan Cashman, Brandeis University.
Our project focuses on analyzing and visualizing ridership distribution for MBTA rails. Our motivation for tackling the problems addressed through our visualization is so that users can navigate the MBTA subway system more easily. This visualization showcases ridership distribution across time periods, directions, and rails. Using various encoding channels like area, color, and position, we've developed an interactive linked visualization.
Its aim is to assist users in understanding passenger flow when using any transportation line at different time periods. The link node chart enables users to explore passenger flow details for each line, while the line and bar charts offer different perspectives—one highlighting time sensitivity and the other providing an overview of passenger flow across rail lines.
See above in the demo video section for demonstration on our visualization.
The dataset used in this project is the MBTA Rail Ridership, available on the official MBTA website. This comprehensive dataset offers an intricate overview of ridership data encompassing all MBTA subway lines. It furnishes detailed information for each specific stop on every line, including the average count of passenger flow and more detailed information like ons and offs for each line. Our dataset contains mainly attribute data types, including both categorical and quantitative attribute types. Notably, it comprises data from various seasons, yet for the sake of consistency and to avoid pandemic-related disruptions, we specifically selected Fall 2019 data, representing the latest non-pandemic period available which we believe best mirrors the ridership status for current MBTA lines.
The steps for the data preprocessing were:
1. Downloaded raw data in csv from the web (MBTA_Rail_Ridership.csv) and deleted irrelevant columns that we didn’t want to get raw json file (data.json)
2. Grouped and calculated the average flow, ons, and offs based on the subway line and time period. The calculated data is saved into json file preprocessed_data.json
3. Load preprocessed_data.json into js file to be visualized.
All the data files we used in js are uploaded in the data folder. All the data processing code can be found in dataloader.js.
Our domain tasks for this project are as follows:
1. What is the estimated passenger flow distribution for each subway line at this time period of the day? (analytic task: summarize, search task: explore, analyze task: discover)
2. Utilizing a time table, how does the passenger flow distribution change throughout the different time periods of the day? (analytic task: summarize, search task: explore, analyze task: enjoy)
Goals for first domain task:
Goals for second domain task:
Early Sketches:
In the bar-graph sketch above, we chose to use lines for the marks and each bar(line) represents a subway line. The channels used are vertical lengths, horizontal positions, and colors. We chose to use vertical lengths so that the user could easily distinguish the passenger volume for each line and see which line has the least or greatest volume. Additionally, we chose horizontal positions to represent the different subway lines and colors to relate to the names of the subway lines.
In the line graph sketch above, we chose lines and points for the marks. We chose lines to represent each subway line and each point on the line to represent the different times of the day. Our design choice for this was so that the user could easily see the passenger flow across different time periods of the day easily. Our color decision aligns with our rationale for its usage in the bar-graph sketch.
Final Sketch:
From our previous beginning sketches, our design choices remained consistent with what we described previously within our final sketch drawings (shown above). We kept the same channel and marks for both the line and bar graph and our justifications for both remain the same.
Our visualization effectively portrays the passenger flow across each MBTA line during various times of the day. We achieved this through a linked timetable, line chart, and bar graph, enabling users to discern specific trends across the lines. For future enhancements, our project aims to incorporate all commuter rail lines and comprehensively detail all stations along each line.
List here where any code, packages/libraries, text, images, designs, etc. that you leverage come from.