Repositories

mGalarnyk repositories

31 supported repositories

Nearly 3 million rows of auto accidents in the USA over several years. I'm trying to do a barplot race....

Last commit Apr 22, 2020

 (2 stars) (3 forks) (0 indexed issues) (0 open good first issues)

Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)

Last commit Aug 4, 2016

 (102 stars) (85 forks) (0 indexed issues) (0 open good first issues)

Database Management Systems Data Science Masters Course (DSE 201)

Last commit Jun 26, 2016

 (12 stars) (9 forks) (0 indexed issues) (0 open good first issues)

Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)

Last commit Aug 21, 2017

 (181 stars) (126 forks) (0 indexed issues) (0 open good first issues)

Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practical side of machine-learning and data analysis. The topics covered in this class include topics in supervised learning, such as k-nearest neighbor classifiers, decision trees, boosting and perceptrons, and topics in unsupervised learning, such as k-means, PCA and Gaussian mixture models.

Last commit Mar 26, 2018

 (54 stars) (38 forks) (0 indexed issues) (0 open good first issues)

Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.

Last commit Apr 3, 2017

 (34 stars) (22 forks) (0 indexed issues) (0 open good first issues)

Last commit Aug 6, 2021

 (24 stars) (10 forks) (0 indexed issues) (0 open good first issues)

Interview stuff for friends

Last commit Jan 25, 2022

 (84 stars) (63 forks) (0 indexed issues) (0 open good first issues)

Last commit Mar 10, 2026

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Last commit Jan 24, 2023

 (235 stars) (163 forks) (0 indexed issues) (0 open good first issues)

Last commit Feb 11, 2025

 (0 stars) (0 forks) (0 indexed issues) (0 open good first issues)

Resources for my LinkedIn Learning Courses

Last commit Jul 18, 2023

 (1 star) (0 forks) (0 indexed issues) (0 open good first issues)

GitHub Repo for MGT-6090 Assignment 8 BHC.

Last commit Nov 15, 2023

 (0 stars) (1 fork) (0 indexed issues) (0 open good first issues)

Last commit Apr 27, 2026

 (7 stars) (3 forks) (0 indexed issues) (0 open good first issues)

Coursera machine learning specialization coursework (python based, University of Washington).

Last commit Mar 28, 2016

 (18 stars) (20 forks) (0 indexed issues) (0 open good first issues)

Last commit Jun 26, 2016

 (3 stars) (6 forks) (0 indexed issues) (0 open good first issues)

Python tutorials in both Jupyter Notebook and youtube format.

Last commit Apr 17, 2026

 (1,256 stars) (1,133 forks) (0 indexed issues) (0 open good first issues)

Shingho is a PySpark based statistical library designed for Big Data applications.

Last commit Apr 17, 2017

 (1 star) (1 fork) (0 indexed issues) (0 open good first issues)

This is a repo to keep the data for my tutorials. This is to make it so people dont need a Kaggle account and such as much as possible.

Last commit Feb 15, 2026

 (9 stars) (4 forks) (0 indexed issues) (0 open good first issues)

Legally allowable public portion of the UCSD Extension course: Data Analytics Using Python (CSE-41204)

Last commit Sep 15, 2023

 (7 stars) (4 forks) (0 indexed issues) (0 open good first issues)