Learning Goal: I’m working on a data analytics project and need support to help
Learning Goal: I'm working on a data analytics project and need support to help me learn.Project DescriptionIn this project, you'll create visualizations to reveal insights from a data set. You will create data visualizations that tell a story or highlight patterns in the data set. Your work should be a reflection of the theory and practice of data visualization, such as visual encodings, design principles, and effective communication.There are 3 different data sets you can choose from.Flight Delays and Cancellations US Census Demographic Data Youtube Data from the US 1) Flight Delays and CancellationsThis data comes from a Kaggle dataset, it tracks the on-time performance of US domestic flights operated by large air carriers in 2015. You can find the dataset in supporting materials at the bottom of this page.The file you must use in creating your data visualizations is the flights.csv file. The other two provided files may be used in conjunction with the flights.csv file, but should not be used alone.You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:Which airlines or airports have the worst delays?Determine which destinations and arrival destinations have the most delays? Doing this using maps is actually pretty difficult, but you may choose an alternative visual to provide this information. Think about what kind of aggregates might work best to determine which airlines and airports are the best and worst in terms of delays. What causes delays?Think about if you work at an airline and you want to decrease delays. What part of the flight causes the most delays? Do these causes vary by airport or time of year? You can also come up with your own question!As you work with the data, come up with a question you're curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer. 2) US Census Demographic DataThis data comes from a Kaggle dataset, it includes the census data for all counties in 2015. You can find the dataset in supporting materials at the bottom of this page. Required dashboardsYou are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:Which states have the best transportation?This is a fairly subjective question, so your first job is to define what the best transportation is. Is it highest percentage of transit use? Is it lowest mean commute times. Then you need to determine how to aggregate the data from the county level to the state. Are there outlier counties affecting the data? How should you aggregate all the data from the counties to represent the state effectively? Please provide your reasoning in your report. How does income and poverty look across America?Think about how best to contrast this data to show an interesting finding. You can look across many of the different fields to show interesting findings. Do counties with more construction experience more or less poverty? Do counties near the coast experiene more or less income? Remember this is all correlation and not causation so we cannot say any one thing causes it but we can report descriptive statistics. You can also come up with your own question!As you work with the data, come up with a question you're curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer. 3) Youtube Data USThis data comes from a Kaggle dataset, it includes a bunch of information for videos that were trending for at least one day. This data set will require some cleaning in excel prior to Tableau. You can find the dataset in supporting materials at the bottom of this page.You are required to create three visualizations. Some questions you may attempt to answer include those pertaining to the following areas:What tags have grow in popularity over time? (The tag field is very hard to work with, consider asking a different question. Reminder all of the questions here are suggestions, they are not required. There is an additional file with the tags transposed into a a single column that you can make use of as well. To join it in you will want to join it on channel title and title of the video)You will need to first figure out how to clean the data from having bunch of tag words all in one cell to a format where you can count the number of times the individual tag word is used each day, month, or year depending on how you are answering this question What categories are the most liked and disliked?To answer this question you will need to replace the numeric category IDs with the actual names which are in a separate file. Then you can looks at this question deeper. Perhaps there is a time of year where one category is preferred? Are there certain channels with an outlier amount of likes or dislikes? You can also come up with your own question!As you work with the data, come up with a question you're curious about and can be answered from the data. Build a dashboard or story to answer your question and lead viewers to that answer. RequirementsUse the Project Rubric to review your project. If you are happy with your submission, then you are ready to submit! If you see room for improvement in any category in which you do not meet specifications, keep working!Your project will be evaluated by a Udacity reviewer according to the same Project Rubric. Your project must "Meet Specifications" in each category in order for your submission to pass.RemindersYour visualization work should use Tableau: other visualization tools may not be evaluated by reviewers. Host your dashboards/stories on Tableau Public, you'll be submitting them to reviewers and this makes them easily accessible.Remember that your visualization should be explanatory in nature and communicate specific results that you want to show.SubmissionA PDF or Markdown report that includes the following sections:Links to your dashboards or story You must submit url links for each of your visuals from Tableau Public. If you need a reminder on how to save to Tableau Public, please see the next concept. Summary: brief description of the visualization and the main story or findings conveyed Design: explain any design choices you made including changes to the visualization after collecting feedback Resources: list of Web sites, books, forums, blog posts, GitHub repositories etc that you referred to or used in this submission (Add N/A if you did not use such resources). Review the column MetadataSome of the columns you want to use in your project will have coded values that represent longer more readable values. For instance the cancellation_reason column in the flights data set has the values: A, B, C, D These letters are not understandable by themselves. You need to replace these letters with the full reason to make your visualizations including this data more readable.These letters correspond with the following reasons.A - Airline/CarrierB - WeatherC - National Air SystemD - SecurityYou should review the Column Metadata tab on Kaggle for each data set to find details about the data like the one I have outlined above.Flightslink here: https://www.kaggle.com/usdot/flight-delays/dataUS Demographic datalink here: https://www.kaggle.com/muonneutrino/us-census-demographic-data/dataclick the second data link that is the county file which is what we are using for the project Youtube US datalink here: https://www.kaggle.com/datasnaek/youtube-new/dataThere is no column metadata for this file, reach out on Study Groups or Knowledge if you are needing assistance interpreting a column The location data was added so that you could make use of the map visualization, treat this as the upload location for each video. PROJECT SPECIFICATIONTelling Stories with DataVisualization is ExplanatoryCRITERIAMEETS SPECIFICATIONSDoes the visualization have a clear finding?The visualization centers on a specific, clear finding in the data.Does the visualization focus on its finding?The selected finding is clearly communicated. Design choices foster communication between the reader and the visualization.Visualization does not add additional colors, shapes, or other design elements in an unnecessary way. Rather, each additional element should add to the insight being made.DesignCRITERIAMEETS SPECIFICATIONSDoes the written summary reflect what a reader would interpret from the graphic?The written summary should include a brief description of the visualization and state at least one finding.A reader’s summary of the graphic would closely match the written summary in the writeup, and a reader is able to identify at least one main point or relationship that the graphic attempts to convey.To reiterate your report should include at least 3 sets ofLink(s) to your dashboards or story Summary: brief description of the visualization and the main story or findings conveyed (please include an insight you are able to make from the visualization) Design: explain any design choices you made including changes to the visualization after collecting feedback Resources: list of Web sites, books, forums, blog posts, GitHub repositories etc that you referred to or used in this submission (Add N/A if you did not use such resources). Does the data visualization incorporate interaction or animation?The visualization includes interaction or animation. The inclusion of filters and additional variables shown in tool tip as appropriate within the visualization interaction are present.At minimum you are required to include a filter in one visualization and you are required to include a tool tip in at least one visualization. You should strive to include these anywhere where they would benefit your visualization.Are colors used appropriately?Color choices must accurately reflect the data and be chosen with accessibility in mind. For example, values that span from negative to positive numbers should be encoded with a diverging palette. Also, the color palettes should work for colorblindness.Are the plots appropriate for the data types?Line plots for sequences, bar charts for categorical variables, etc.CompletenessCRITERIAMEETS SPECIFICATIONSThe three visualizations are provided.The three visualizations are included. These visualizations may be a single worksheet, but at least one must be a dashboard involving more than one worksheet. A dashboard counts as a single visualization. All visualizations must be clearly connected to a finding, and foster the interaction pieces (filters, colors, etc.) that allow for the finding to be found easily by a user.One Dashboard is required. A Dashboard is an option in Tableau that allows you to combine multiple charts into one page. This counts as 1 visualization.Two other unique visualizations are also required, These can be two single worksheets, two more dashboards, two more stories, or any combination of worksheet, dashboard, or story.Are links to each of the dashboards hosted on Tableau Public provided?The visuals need to be saved to Tableau Public and the links to those visuals must be provided in the report along with the finding for each visual.If you are unable to save to Tableau Public please include screenshots in your pdf report of each visualization. If you choose to use screenshots you should include at least one screenshot of your filters being used (a before and after picture of the visualization).For each visual, there is a written response as to the insight(s) gained from the visual.The insight(s) should be accurate and easily available from the filters and interactivity available in the visual.Across your three visuals, you should incorporate at least 3 different types of visuals.Each visual must be appropriate for the particular data type. However, you cannot submit three bar charts, or three line charts. You should have a minimum of at least three different types of visuals across all of your turned in items.3 Different types of charts required, here are some types you can chooseBar Chart Line Chart Scatter Chart Histogram Bubble Chart Map Area Chart Pie Chartetc Requirements: No Limit   |   .doc file

Leave a Reply

Your email address will not be published.