A baby named Al*

[This article was first published on R – scottishsnow, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. About half the males in my team seem to be called … Read moreA baby named Al*

Debugging in R: How to Easily and Efficiently Conquer Errors in Your Code

[This article was first published on INWT-Blog-RBloggers, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. When you write code, you’re sure to run into problems from time to … Read moreDebugging in R: How to Easily and Efficiently Conquer Errors in Your Code

Learning R: Data Wrangling in Password Hacking Game

Data Scientists know that about 80% of a Data Science project consists of preparing the data so that they can be analyzed. Building Machine Learning models is the fun part that only comes afterwards! This process is called Data Wrangling (or Data Munging). If you want to use some Base R data wrangling techniques in … Read moreLearning R: Data Wrangling in Password Hacking Game

Hangman game with R

[This article was first published on R – TomazTsql, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Hangman is a classic word game in which you need to … Read moreHangman game with R

An Introduction to Binomials & Inference

Inference is about drawing conclusions about a greater population via some sample of observed data. For example, you have some sample of the countries opinion on the president and you’d like to make some conclusions about the population at large. Obviously you wont be asking every single citizen, rather you will make an inference about … Read moreAn Introduction to Binomials & Inference

Customer Churn Modeling using Machine Learning with parsnip

[This article was first published on business-science.io, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. This article comes from Diego Usai, a student in Business Science University. Diego … Read moreCustomer Churn Modeling using Machine Learning with parsnip

Risk Modeling: The Power of Capital Advantage in Random-Walk Zero Sum Games (Part 1)

In this article, I will demonstrate a crucial mathematical concept that is essential to constructing risk management strategies in games of chance. Using experimentation with random number generators in R, we will explore how having a fixed capital advantage over an opponent in a model investment game will yield significant probabilistic advantage towards winning in … Read moreRisk Modeling: The Power of Capital Advantage in Random-Walk Zero Sum Games (Part 1)

Our first artist in residence: Allison Horst!

[This article was first published on RStudio Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. I’m very excited to announce that Allison Horst is RStudio’s inaugural artist-in-residence. … Read moreOur first artist in residence: Allison Horst!

Eigenvectors from Eigenvalues – a NumPy implementation

I was intrigued by the recent splashy result showing how eigenvectors can be computed from eigenvalues alone. The finding was covered in Quanta magazine and the original paper is pretty easy to understand, even for a non-mathematician. Being a non-mathematician myself, I tend to look for insights and understanding via computation, rather than strict proofs. … Read moreEigenvectors from Eigenvalues – a NumPy implementation

RcppEigen 0.3.3.7.0

[This article was first published on Thinking inside the box , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. A new minor release 0.3.3.7.0 of RcppEigen arrived on … Read moreRcppEigen 0.3.3.7.0

My R Style Guide

Notation and naming File names File names end in .R and are meaningful about their content: Good: string-algorithms.R utility-functions.R Bad: foo.R foo.Rcode stuff.R Function names Preferrably function names consist of lowercase words separated by an underscore. Using dot (.) separator is avoided as this confuses with the use of generic (S3) functions. It also prevents … Read moreMy R Style Guide

Practical Data Science with R, 2nd Edition, IS OUT!!!!!!!

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Practical Data Science with R, 2nd Edition author Dr. Nina … Read morePractical Data Science with R, 2nd Edition, IS OUT!!!!!!!

workloopR: Analysis of work loops and other data from muscle physiology experiments in R

Studies of muscle physiology often rely on closed-source, proprietary software for not only recording data but also for data wrangling and analyses. Although specialized software might be necessary to record data from highly-specialized equipment, data wrangling and analyses should be free from this constraint. It’s becoming more common for researchers to provide code along with … Read moreworkloopR: Analysis of work loops and other data from muscle physiology experiments in R

Machine Learning in R: Start with an End-to-End Test

[This article was first published on R – David’s blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. As a data scientist, you will likely be asked one … Read moreMachine Learning in R: Start with an End-to-End Test

Durban EDGE DataQuest

[This article was first published on R | datawookie, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. The Durban EDGE (Economic Development and Growth in eThekwini) DataQuest was … Read moreDurban EDGE DataQuest

Automating update of a fiscal database for the Euro Area

Our purpose is to write a program to automatically update a quarterly fiscal database for the Euro Area. The main difficulty of this exercise is to build long series that go as far as the 1980’s. We use two sources to build the database: the historical database developed in Paredes et al. (2014), which stops in … Read moreAutomating update of a fiscal database for the Euro Area

When Cross-Validation is More Powerful than Regularization

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Regularization is a way of avoiding overfit by restricting the … Read moreWhen Cross-Validation is More Powerful than Regularization

Logistic Regression in R: A Classification Technique to Predict Credit Card Default

Logistic regression is one of the statistical techniques in machine learning used to form prediction models. It is one of the most popular classification algorithms mostly used for binary classification problems (problems with two class values, however, some variants may deal with multiple classes as well). It’s used for various research and industrial problems. Therefore, … Read moreLogistic Regression in R: A Classification Technique to Predict Credit Card Default

Azure AI and Machine Learning talk series

At last week’s Microsoft Ignite conference in Orlando, our team delivered a series of 6 talks about AI and machine learning applications with Azure. The videos from each talk are linked below, and you can watch every talk from the conference online (no registration necessary). Each of our talks also comes with a companion Github … Read moreAzure AI and Machine Learning talk series

My AP Statistics Class First R Programming Assignment Using RStudio

[This article was first published on R – Saturn Science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. My AP Stats class has started their first R programming … Read moreMy AP Statistics Class First R Programming Assignment Using RStudio

RcppAnnoy 0.0.14

[This article was first published on Thinking inside the box , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. A new minor release of RcppAnnoy is now on … Read moreRcppAnnoy 0.0.14

dplyr and Oracle database with odbc on windows

[This article was first published on Guillaume Pressiat, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. RStudio makes Oracle accessibility from R easier via odbc and connections Pane.Personally, … Read moredplyr and Oracle database with odbc on windows

An API for @racently

[This article was first published on R | datawookie, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. @racently is a side project that I have been nursing along … Read moreAn API for @racently

What can we really expect to learn from a pilot study?

[This article was first published on ouR data generation, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. I am involved with a very interesting project – the NIA … Read moreWhat can we really expect to learn from a pilot study?

Community Call – Last Night, Testing Saved my Life

To the uninitiated, software testing may seem variously boring, daunting or bogged down in obscure terminology. However, it has the potential to be enormously useful for people developing software at any level of expertise, and can often be put into practice with relatively little effort. Our 1-hour Call will include two speakers and at least … Read moreCommunity Call – Last Night, Testing Saved my Life

Using R and H2O Isolation Forest For Data Quality

[This article was first published on R-Analytics, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. suppressWarnings( suppressMessages( library( h2o ) ) ) suppressWarnings( suppressMessages( library( dygraphs ) ) … Read moreUsing R and H2O Isolation Forest For Data Quality

A comparison of methods for predicting clothing classes using the Fashion MNIST dataset in RStudio and Python (Part 1)

[This article was first published on R Views, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Florianne Verkroost is a PhD candidate at Nuffield College at the University … Read moreA comparison of methods for predicting clothing classes using the Fashion MNIST dataset in RStudio and Python (Part 1)

Statistical uncertainty with R and pdqr

General description Statistical estimation usually has the following setup. There is a sample (observed, usually randomly chosen, set of values of measurable quantities) from some general population (whole set of values of the same measurable quantities). We need to make conclusions about the general population based on a sample. This is done by computing summary … Read moreStatistical uncertainty with R and pdqr

Cleaning the Table

[This article was first published on R on kieranhealy.org, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. While I’m talking about getting data into R this weekend, here’s … Read moreCleaning the Table

Dangerous streets of Bratislava! Animated maps using open data in R

[This article was first published on Peter Laurinec, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. At the work recently, I wanted to make some interesting start-up pitch … Read moreDangerous streets of Bratislava! Animated maps using open data in R

future 1.15.0 – Lazy Futures are Now Launched if Queried

[This article was first published on JottR on R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. No dogs were harmed while making this release future 1.15.0 is … Read morefuture 1.15.0 – Lazy Futures are Now Launched if Queried

Reading in Data

[This article was first published on R on kieranhealy.org, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Here’s a common situation: you have a folder full of similarly-formatted … Read moreReading in Data

Using Spark from R for performance with arbitrary code – Part 4 – Using the lower-level invoke API to manipulate Spark’s Java objects from R

[This article was first published on Jozef’s Rblog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. In the previous parts of this series, we have shown how to … Read moreUsing Spark from R for performance with arbitrary code – Part 4 – Using the lower-level invoke API to manipulate Spark’s Java objects from R

Learning Linux – the wrong way – day 2

[This article was first published on HighlandR, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Unborking the borked laptop – Recap I’m trying to learn some Linux. Ostensibly … Read moreLearning Linux – the wrong way – day 2