Data Science Austria

Python vs SQL: Comparison for Data Pipelines

Breaking into the workforce as a web developer, my first interaction with databases and SQL was using an Object Relational Model (ORM). I was using the Django query sets API and had an excellent experience using the interface. Thereon-after, I changed to a data engineering role and became much more … Read morePython vs SQL: Comparison for Data Pipelines

How to Install OpenAI Gym in a Windows Environment

A step by step guide for getting OpenAI Gym up and running This is the second in a series of articles about reinforcement learning and OpenAI Gym. The first part can be found here. Introduction OpenAI Gym is an awesome tool which makes it possible for computer scientists, both amateur and … Read moreHow to Install OpenAI Gym in a Windows Environment

Image Segmentation using Python’s scikit-image module

Supervised segmentation Thresholding is a very basic segmentation process and will not work properly in a high-contrast image for which we will be needing more advanced tools. For this section, we will use an example image which is freely available and attempt to segment the head portion using supervised segmentation … Read moreImage Segmentation using Python’s scikit-image module

Explain Python classes and objects to my nephew (+advanced use)

It is common secret that Python programming language has a solid claim to being the fastest-growing major programming language witnessing an extraordinary growth in the last five years, as seen by Stack Overflow traffic. Based on data describing the Stack Overflow question views which go to late 2011, the growth … Read moreExplain Python classes and objects to my nephew (+advanced use)

10 Python File System Methods You Should Know

In this guide we’ll look at methods from the os and shutil modules. The os module is the primary Python module for interacting with the operating system. The shutil module also contains high-level file operations. For some reason you make directories with os but move and copy them with shutil. … Read more10 Python File System Methods You Should Know

Basketball Analytics: Predicting Win Shares

Analysis Objective: Can we predict individual win shares of NBA players using other basketball metrics? The data used for this analysis is from the 2016–17 and 2017–2018 NBA Season, using Basketball-Reference. Essentially, I used data from the 2016–2017 NBA season to create our model and stats from the most recent … Read moreBasketball Analytics: Predicting Win Shares

Backpropagation for people who are afraid of math

Backpropagation is one of the most important concepts in machine learning. There are many online resources that explain the intuition behind this algorithm (IMO the best of these is the backpropagation lecture in the Stanford cs231n video lectures. Another very good source, is this), but getting from the intuition to … Read moreBackpropagation for people who are afraid of math

How to Automatically Import Your Favorite Libraries into IPython or a Jupyter Notebook

No more typing “import pandas as pd” 10 times a day If you often use interactive IPython sessions or Jupyter Notebooks and you’re getting tired of importing the same libraries over and over, try this: Navigate to ~/.ipython/profile_default Create a folder called startup if it’s not already there Add a new … Read moreHow to Automatically Import Your Favorite Libraries into IPython or a Jupyter Notebook

Performing multidimensional matrix operations using numpy’s broadcasting

Numpy’s broadcasting feature can be somewhat confusing for new users of this library, but as it allows for very clean, elegant and FUN coding. It is definitely worth the effort of getting used to. In this short article, I wanted to show a nice implementation of broadcasting to save some … Read morePerforming multidimensional matrix operations using numpy’s broadcasting