[Full Name]
[SNHU Email]
Southern New Hampshire University
Note: Replace the bracketed text on page one (the cover page) with your personal information.
Introduction: Problem Statement
Discuss the statement of the problem in terms of the statistical analyses that are being performed. In your response, you should address the following questions:
· What is the problem you are going to solve?
· What data set are you using?
· What statistical methods will you be using to do the analysis for this project?
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report.
Introduction: Your Team and the Assigned Team
In this project, you picked a team and you were assigned a team to do comparative analysis.
See Steps 1 and 2 in the Python script to address the following items:
· What team did you pick and what years were picked to do the analysis?
· What team and range of years were you assigned for the comparative study? (Hint: This is called the assigned team in the Python script.) Present this information in a formatted table as shown below.
Table 1. Information on the Teams
Name of Team
Assigned Years
1. Yours
Team (e.g. Knicks)
XXXX-YYYY (e.g. 2013 - 2015)
2. Assigned
Team (e.g. Bulls)
XXXX-YYYY (e.g. 2013 - 2015)
Answer the questions in a paragraph response. Remove all questions and this note (but not the table) before submitting! Do not include Python code in your report.
Data Visualization: Points Scored by Your Team
In the Python script, you created a visualization for the distribution of points scored by your team.
See Step 3 in the Python script to address the following items in a paragraph response:
· In general, how is data visualization used to study data distributions and trends?
· In this activity, you were asked to pick one of the two plots that best describes the data distribution of the variable for your team. Include a screenshot of this plot in your report.
· Why did you pick this plot? Explain.
· What can you say about the distribution of the variable by visually inspecting this plot? What does this signify?
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report.
Data Visualization: Points Scored by the Assigned Team
In the Python script, you created a visualization for the distribution of points scored by the assigned team.
See Step 4 in the Python script to address the following items in a paragraph response:
· In this activity, you were asked to pick one of the two plots that best describes the data distribution of the variable for the assigned team. Include this plot in your report.
· Why did you pick this plot? Explain.
· What can you say about the distribution of the variable by visually inspecting this plot? What does this signify?
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report.
Data Visualization: Comparing the Two Teams
In the Python script, you created a visualization for the difference in the distributions of points scored by your team and the assigned team.
See Step 5 in the Python script to address the following items in a paragraph response:
· In general, how is data visualization used to compare two different data distributions?
· In this activity, you were asked to pick one of the two plots that best compares the data distributions of your team with the assigned team. Include a screenshot of this plot in your report.
· Why did you pick this plot? Explain.
· How do the two distributions compare to each other?
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report.
Descriptive Statistics: Relative Skill of Your Team
In the Python script, you calculated descriptive statistics on the relative skill of your team. These included the mean, median, variance, and standard deviation for the relative skill of your team.
See Step 6 in the Python script to address the following items:
· Summarize all statistics in a formatted table as shown below. Use one row for each statistic. You will need to add rows to the table in order to include all of your statistics.
Table 2. Descriptive Statistics for Relative Skill of Your Team
Statistic Name
Value
Statistic (for example, Mean)
X.XX
*Round off to 2 decimal places.
· In general, how are the measures of central tendency and variability used to analyze a data distribution?
· Interpret each statistic in detail and explain what it represents in this scenario.
· Use the mean and the median to describe the distribution of relative skill of your team.
· Describe the skew: Is it left, right, or bell-shaped?
· Explain which measure of central tendency is best to use to represent the center of the distribution based on its skew.
Answer the questions in a paragraph response. Remove all questions and this note (but not the table) before submitting! Do not include Python code in your report.
Descriptive Statistics: Relative Skill of the Assigned Team
In the Python script, you calculated descriptive statistics on the relative skill of the assigned team. These included the mean, median, variance, and standard deviation for the relative skill of the assigned team.
See Step 7 in the Python script to address the following items:
· Summarize all statistics in a formatted table as shown below. Use one row for each statistic. You will need to add rows to the table in order to include all of your statistics.
Table 3. Descriptive Statistics for Relative Skill of the Assigned Team
Statistic Name
Value
Statistic (for example, Mean)
X.XX
*Round off to 2 decimal places.
· Interpret each statistic in detail and explain what it represents in this scenario.
· Use the mean and the median to describe the distribution of relative skill of the assigned team.
a. Describe the skew: Is it left, right, or bell-shaped?
b. Explain which measure of central tendency is best to use to represent the center of the distribution based on its skew.
· Use the variance and the standard deviation to compare the distributions of relative skill of your team and relative skill of the assigned team. Which of the two teams has a more consistent skill? How do you know?