Repository

Repository di mGalarnyk

Nearly 3 million rows of auto accidents in the USA over several years. I'm trying to do a barplot race....

Ultimo commit 22 apr 2020

 (2 star) (3 fork) (0 issue indicizzate) (0 good first issue aperte)

Homework/Classwork for my DSE 200 Python for Data Analysis Class at UC San Diego (UCSD)

Ultimo commit 4 ago 2016

 (102 star) (85 fork) (0 issue indicizzate) (0 good first issue aperte)

Database Management Systems Data Science Masters Course (DSE 201)

Ultimo commit 26 giu 2016

 (12 star) (9 fork) (0 issue indicizzate) (0 good first issue aperte)

Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)

Ultimo commit 21 ago 2017

 (181 star) (126 fork) (0 issue indicizzate) (0 good first issue aperte)

Repo for my graduate data science machine learning class at UCSD (UC San Diego). This course provides a broad introduction to the practical side of machine-learning and data analysis. The topics covered in this class include topics in supervised learning, such as k-nearest neighbor classifiers, decision trees, boosting and perceptrons, and topics in unsupervised learning, such as k-means, PCA and Gaussian mixture models.

Ultimo commit 26 mar 2018

 (54 star) (38 fork) (0 issue indicizzate) (0 good first issue aperte)

Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.

Ultimo commit 3 apr 2017

 (34 star) (22 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 6 ago 2021

 (24 star) (10 fork) (0 issue indicizzate) (0 good first issue aperte)

Interview stuff for friends

Ultimo commit 25 gen 2022

 (84 star) (63 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 10 mar 2026

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).

Ultimo commit 24 gen 2023

 (235 star) (163 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 11 feb 2025

 (0 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

Resources for my LinkedIn Learning Courses

Ultimo commit 18 lug 2023

 (1 star) (0 fork) (0 issue indicizzate) (0 good first issue aperte)

GitHub Repo for MGT-6090 Assignment 8 BHC.

Ultimo commit 15 nov 2023

 (0 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 27 apr 2026

 (7 star) (3 fork) (0 issue indicizzate) (0 good first issue aperte)

Coursera machine learning specialization coursework (python based, University of Washington).

Ultimo commit 28 mar 2016

 (18 star) (20 fork) (0 issue indicizzate) (0 good first issue aperte)

Ultimo commit 26 giu 2016

 (3 star) (6 fork) (0 issue indicizzate) (0 good first issue aperte)

Python tutorials in both Jupyter Notebook and youtube format.

Ultimo commit 17 apr 2026

 (1256 star) (1133 fork) (0 issue indicizzate) (0 good first issue aperte)

Shingho is a PySpark based statistical library designed for Big Data applications.

Ultimo commit 17 apr 2017

 (1 star) (1 fork) (0 issue indicizzate) (0 good first issue aperte)

This is a repo to keep the data for my tutorials. This is to make it so people dont need a Kaggle account and such as much as possible.

Ultimo commit 15 feb 2026

 (9 star) (4 fork) (0 issue indicizzate) (0 good first issue aperte)

Legally allowable public portion of the UCSD Extension course: Data Analytics Using Python (CSE-41204)

Ultimo commit 15 set 2023

 (7 star) (4 fork) (0 issue indicizzate) (0 good first issue aperte)