Academic Catalog

DS 601 Fundamentals of Data Science

This course introduces the principles of data science field as well as review common functionalities of toolkits that are suitable for a data scientist portfolio, with emphasis on data sets cleaning, processing, merging, manipulating, as well as statistical concepts and test for significance in data. In particular, this course focuses on how to read in datasets into data structures, to query and index these structures, to merge multiple data structures, to summarize data into tables, to group data into logical categories, and to manipulate dates. The coverage also includes essentials of machine learning topics such as regression, classification and clustering as well as the importance of storytelling and data visualizations in data science process. This course is an intensive practice course using real-world datasets and variety of statistical methods for both data cleaning activities and compute statistical metrics for data analysis.

Credits

3