ML Project 1 (Post 4)

Wow, this was a tough one!! I actually had the right idea for this Gaussian Bayes Classifier from the start, but I got totally stuck because my Gaussian values were coming out as 2×2 matrixes instead of probabilities. It turns out my x-mu vectors were being stored as arrays, not matrixes, and in Python, arrays don’t have the same shape. So I don’t know why it didn’t get an error, but the math was coming out all wonky.

I stepped through each piece of that equation, and eventually discovered the “shape” property, which showed me that what I thought was a matrix and a matrix transpose were being seen as the same shape.

ML Project 1 (Post 3)

I present to you… my first classifier! Naive Bayes! It appears to work! haha :)
I know it’s a mess, but I have barely used Python before, and I’m new to Machine Learning, so I’m learning. This is for #2 for my project.

Goals

Since this site is about “becoming a data scientist”, I thought I should define what I think that means by setting some goals for myself.

What is a Data Scientist?

It’s pretty clear based on the variety of (sometimes conflicting) information out there that the definition of “data scientist” is still being developed. Data Science is basically a new trendy term for a collection of things that has already existed: data modeling, data mining, statistical analysis, predictive analytics, machine learning, etc.

Published
Categorized as resources

Why Data Science?

TL;DR: I like it, it matches up well with my existing skills, I think I’ll be good at it, and I can modify my current graduate degree to better prepare myself for a career in data science.