Nearly 3 million rows of auto accidents in the USA over several years. I'm trying to do a barplot race....
Repositories
mGalarnyk repositories
Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)
Database Management Systems Data Science Masters Course (DSE 201)
Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)
Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practical side of machine-learning and data analysis. The topics covered in this class include topics in supervised learning, such as k-nearest neighbor classifiers, decision trees, boosting and perceptrons, and topics in unsupervised learning, such as k-means, PCA and Gaussian mixture models.
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
Interview stuff for friends
Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).
Resources for my LinkedIn Learning Courses
GitHub Repo for MGT-6090 Assignment 8 BHC.
Coursera machine learning specialization coursework (python based, University of Washington).
Python tutorials in both Jupyter Notebook and youtube format.
Shingho is a PySpark based statistical library designed for Big Data applications.
This is a repo to keep the data for my tutorials. This is to make it so people dont need a Kaggle account and such as much as possible.
Legally allowable public portion of the UCSD Extension course: Data Analytics Using Python (CSE-41204)