IBM Data Analyst Capstone Project | Coursera |
Upon completing the final capstone project, participants will apply acquired Data Analytics techniques from preceding IBM Data Analyst Professional Certificate courses. They will simulate the role of an Associate Data Analyst within an organization, tackling a real-world business challenge necessitating comprehensive data analysis. Tasks encompassed include data gathering from diverse sources, data cleaning and preparation, exploratory data analysis, statistical examination and data mining, creation of data visualizations using various charts and plots, and development of interactive dashboards. The project concludes with a comprehensive data analysis report, featuring an executive summary, detailed analysis, and conclusions, serving as a demonstration of proficiency in tools such as Jupyter Notebooks, SQL, relational databases (RDBMS), Business Intelligence (BI) tools like Cognos, and Python libraries including Pandas, Numpy, Scikit-learn, Scipy, Matplotlib, Seaborn, among others. This project significantly enhances one's portfolio and presents a valuable opportunity to exhibit expertise in Data Analytics to potential employers.
Notice!
Always refer to the module on your course for the most accurate and up-to-date information.
Attention!
If you have any questions that are not covered in this post, please feel free to leave them in the comments section below. Thank you for your engagement.
Graded Quiz: Exploring Data
- 11525
- 11555
- 11552
- 11255
- 85
- 26
- 86
- 84
- Categorical
- Numerical
- Text
- Nominal
Graded Quiz: Duplicates
- 154
- 124
- 145
- 99
- 99
- 154
- 10
- 96
Graded Quiz: Removing Duplicates
- 9999
- 11398
- 13456
- 12333
- 11398
- 11342
- 11999
- 11000
Graded Quiz: Missing Values
- 0
- 112
- 231
- 280
- 188
- 0
- 201
- 280
Graded Quiz: Imputing Missing Values
- Retired
- Employed part-time
- Employed full-time
- Health Science
- Fine Arts
- Information Systems
- Computer Science
- median
- mean
- min
- max
- 7
- 3
- 11398
- 5
- 9999
- 312
- 6073
- 3259
- 8080
- 100000
- 6132520
- 9000
Graded Quiz: Distribution
- 29
- 28
- 31
- 30
- 57708
- 56453
- 54678
- 55467
- 24-32
- 40-48
- 32-40
- 16-24
Graded Quiz: Handling Outliers
- 55768
- 55342
- 78675
- 57745
- 53564
- 52704
- 56456
- 55672
- 0
- 2
- 3
- 1
- 10000
- 34567.34
- 2378.56
- 59883
Graded Quiz: Correlation
- WorkWeekHrs
- CompTotal
- Respondent
- CodeRevHrs
- CompTotal
- WorkWeekHrs
- ConvertedComp
- Respondent
Graded Quiz: Visualizing Relationship
- Very High negative correlation.
- No correlation
- Very high positive correlation
- Difficult to clearly identify from the scatter plot.
Graded Quiz: Visualizing Composition of Data
- 19.1
- 18.8
- 20.9
- 16.4
- 3
- 4
- 2
- 1
- 5132
- 5012
- 5321
- 7106
- 474
- 876
- 657
- 736
Graded Quiz: Visualizing Comparison of Data
- Developer - back-end
- Developer - full-stack
- Developer - mobile
- Developer - front-end
- Highly Negative
- No Correlation
- Negative
- Positive
Graded Quiz: Collecting Data
- Pandas
- Requests
- NumPy
- Matplotlib
- 1173
- 2089
- None of the above
- 4025
- Seattle
- New York
- Washington DC
- Log Angeles
Graded Quiz: Web Scraping
- C
- Java
- Swift
- C++
- soup.findall(“pic”)
- soup.findall(“img”)
- soup.find_all(“img”)
- soup.find_all(“image”)