I finally had a chance this weekend to make some progress on my “Data Science Directory” website, DataSciGuide.com, and I would love your feedback on it! That site isn’t open for comments yet, so I’m directing people to leave feedback here. If you haven’t kept up with the development of DataSciGuide, here are a few… Continue reading DataSciGuide Update
Author: Renee
The Imitation Game, and the Human Element in Data Science
Last night, my husband and I watched The Imitation Game. First of all, it’s a great movie and you should see it. Secondly, there was a moment that got me thinking about the human element of machine learning.
[Spoiler Alerts – but you probably already know much of the story, and the movie is still good even if you know the historical outcome.]
I thought a moment like this may be coming when Alan Turing was first applying to work at Bletchley Park, and Denniston can’t believe he’s applying to be a Nazi codebreaker without even knowing how to speak German. Alan emphasizes that he is masterful at games and solving puzzles, and that the Nazi Enigma machine is a puzzle he wants to solve. He starts designing and building a machine that will theoretically be able to decode the Nazi radio transmissions, but the decoder settings change every day at 12am, so the machine must solve for the settings before the stroke of midnight every day in order for the day’s messages to be decoded in time to be useful and not interfere with the next day’s decoding process. Turing can’t prove his machine will work, simply because it is simply taking too long to solve the daily puzzle. In the meantime, people are dying in the war, and the Nazis are going on transmitting their messages over normal radio waves believing the code is “unbreakable”.
My “Secret” Side Project, Revealed
OK So I was actually hoping to show this to you all long ago, and I kept coming up with more and more ideas for it, so it’s not going to be “ready” to reveal for a while, but I figured I’d go ahead and show it to you anyway. My main motivation is that… Continue reading My “Secret” Side Project, Revealed
Entry Level Data Analyst Skills
Between an interview from a local TV station about my job and going through the process of hiring someone onto our team, I’ve been thinking about what would be the bare minimum skills someone would need to have a chance at being hired as a data analyst. Maybe this would be a helpful list for… Continue reading Entry Level Data Analyst Skills
The Data Science Central “Incident”
I’m writing this post to respond both to what many of you saw Vincent Granville said about me on Facebook a couple days ago, which was brought to my attention yesterday: (in context) and to his apology this evening: I didn’t want to write a second post about Data Science Central, but after the huge… Continue reading The Data Science Central “Incident”
API and Market Basket Analysis
I was considering waiting until I’m done before posting about this project, but instead I thought I’d post my progress and plans while I think about the next steps. I posted earlier about using the UsesThis API to retrieve data about what other software people that use X software also use. I thought I was… Continue reading API and Market Basket Analysis
The Setup (usesthis.com) API
There’s a really interesting site usesthis.com AKA “The Setup” which interviews people and lists all of the gear that they use, including software. I found out that they have an API, (documented here) and I wanted to use my new API skills in Python to test it out! This one returns JSON unlike the NPR… Continue reading The Setup (usesthis.com) API
May 2015 #SoDS Storify
I wanted to capture the participation in the #SoDS (Summer of Data Science) hashtag somehow, so I decided to create a monthly Storify to keep up with all of your great tweets! [View the story “#SoDS May 2015” on Storify]
Monday Silliness
OK so I’m tired enough to be a little silly right now, and I think I’m finally feeling bold enough to share my parody songs inspired by these twitter exchanges: @matt_slotnick @dpatil haha i've said b4 that O'Jays "for the love of money" but w/"data data data…DATA" runs thru my head b4 projects :) —… Continue reading Monday Silliness
IPython, Requests, lxml, and the NPR API
Last week, I decided to learn how to use python to get data from an API. I started with the Codecademy “Introduction to APIs in Python” course, which got me oriented to how requests work, and in the subsequent NPR API lesson, specifically how the NPR stories API works. Certain parts of the course assumed… Continue reading IPython, Requests, lxml, and the NPR API