Data Science Austria

Exploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

Well, 2018 has flown by and today seems like an appropriate time to take a look at the landscape of R bloggerdom as seen through the eyes of readers of R-bloggers and R Weekly. We’ll do this via a new package designed to make it easier to treat Feedly as … Read moreExploring 2018 R-bloggers & R Weekly Posts with Feedly & the ‘seymour’ package

An NLP View on Holiday Movies — Part II: Text Generation using LSTM’s in Keras

Dec 30, 2018 Photo by rawpixel on Unsplash Continuing on the first part of this blog post, let’s see if we can train an RNN with the input sequences, and use that to generate some new ones. The code for this part can be found here. The full code and the … Read moreAn NLP View on Holiday Movies — Part II: Text Generation using LSTM’s in Keras

An NLP View on Holiday Movies — Part I: Topic Modeling using Gensim and SKlearn

Dec 30, 2018 Photo by Tom Coomer on Unsplash Holidays are a time for family, friends, snow and as far as my wife is concerned: corny holiday movies. To try and help her never-ending Holiday movie appetite, we’re going to check if we can create a new Christmas movie. The blog … Read moreAn NLP View on Holiday Movies — Part I: Topic Modeling using Gensim and SKlearn

Build Log Analytics Application using Apache Spark

Step by step process of developing a real world application using Apache Spark, along with main focus on explaining the architecture of Spark. Image Source: techgyo.com Why Apache Spark Architecture if we have Hadoop? The Hadoop Distributed File System (HDFS), which stores files in a Hadoop-native format and parallelizes them … Read moreBuild Log Analytics Application using Apache Spark

Activation Functions in Neural Networks

The motive, use cases, advantages and limitations tl;dr The post discusses the various linear and non-linear activation functions used in deep learning and neural networks. We also take a look into how each function performs in different situations, the advantages and disadvantages of each then finally concluding with one last … Read moreActivation Functions in Neural Networks

Leaf Plant Classification: Statistical Learning Model – Part 2

Categories Advanced Modeling Tags Linear Regression Principal Component Analysis R Programming In this post, I am going to build a statistical learning model as based upon plant leaf datasets introduced in part one of this tutorial. We have available three datasets, each one providing sixteen samples each of one-hundred plant … Read moreLeaf Plant Classification: Statistical Learning Model – Part 2