From one Regression to Hundreds Within Seconds: A Shiny Specification Curve

Online appendices detailing the robustness of empirical analyses areparamount but they never let readers explore all reasonable researcherdegrees of freedom. Simonsohn, Simmons and Nelson suggesta ‘specification curve’ that allows readers to eyeball how a main coefficientof interest varies across a wide arrange of specifications. I build on this ideaby making it interactive: A shiny-based web … Read more From one Regression to Hundreds Within Seconds: A Shiny Specification Curve

celebRation 2020

The year 2020 marks the 20th anniversary ofthe release of R version 1.0.0! To celebrate this, we are inviting thecommunity of R users and developers for a two-day celebRationworkshop/mini-conference on February 28-29th 2020 in Copenhagen. We kick off on 28th February with hands-on workshops on two hottopics, namely data visualization using contemporaryggplot2and extending R withC++ … Read more celebRation 2020

Practical Data Science with R 2nd Edition now in-stock at Amazon.com!

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Practical Data Science with R 2nd Edition is now in-stock … Read more Practical Data Science with R 2nd Edition now in-stock at Amazon.com!

How to make a precision recall curve in R

Precision recall (PR) curves are useful for machine learning model evaluation when there is an extreme imbalance in the data and the analyst is interested particuarly in one class. A good example is credit card fraud, where the instances of fraud are extremely few compared with non fraud. Here are some facts about PR curves. … Read more How to make a precision recall curve in R

parcats 0.0.1 released

[This article was first published on R on datistics, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. parcats was released on CRAN. It is an htmlwidget providing bindings … Read more parcats 0.0.1 released

R Shiny for beginners: annotated starter code

This week I decided to get started with the R shiny package for interactive web applications. As an absolute beginner, I want to document my learning journey in the hope that it will be useful for other first-time shiny users. This post assumes some basic familiarity with R and the tidyverse, but no prior knowledge … Read more R Shiny for beginners: annotated starter code

How to reorder arrange bars with in each Facet of ggplot

One of the problems that we usually face with ggplot is that rearranging the bars in ascending or descending order. If that problem is solved using reorder() or fct_reorder(), the next problem is when we have facets and ordering bars within each facet. Recently I came acrosss this function reorder_within() from the package tidytext (Thanks … Read more How to reorder arrange bars with in each Facet of ggplot

Job Posting: Research Software Engineer at EcoHealth Alliance

The position will be based at EcoHealth Alliance in New York City. EcoHealth Alliance is an equal opportunity employer and we encourage people with a diverse range of backgrounds to apply. We offer comprehensive benefit package including health, dental, and vision coverage, and a 403(b) pension plan. How to Apply: Send an email with a … Read more Job Posting: Research Software Engineer at EcoHealth Alliance

In-Database Logisitc Regression with R

Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. In a previous article we illustrated how to calculate xgboost model predictions in-database. This was referenced and incorporated into tidypredict. After learning more about what the tidypredict team is up to, I discovered another tidyverse package called modeldb that fits models … Read more In-Database Logisitc Regression with R

Advent of Code 2019-04 with R & JavaScript

[This article was first published on Colin Fay, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Solving Advent of Code 2019-04 with R and JavaScript. [Disclaimer] Obviously, this … Read more Advent of Code 2019-04 with R & JavaScript

How Auth0’s Data Team uses R and Python

The Data team is responsible for crunching, reporting, and serving data. The team also does data integrations with other systems, creating machine learning, and deep learning models. With this post, we intend to share our favorite tools, which are proven to run with thousands of millions of data.Scaling processes in real-world scenarios is a hot … Read more How Auth0’s Data Team uses R and Python

RStudio’s Commercial Desktop License is now RStudio Desktop Pro

[This article was first published on RStudio Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. We have good news for our commercial desktop IDE customers. We are … Read more RStudio’s Commercial Desktop License is now RStudio Desktop Pro

Introducing Mark Padgham, rOpenSci’s new Software Research Scientist

We’re thrilled to be introducing a new member of our team. Mark Padgham has joined rOpenSci as a Software Research Scientist working full-time from Münster, Germany. Mark will play a key role in research and development of statistical software standards and expanding our efforts in software peer review, enabled by new funding from the Sloan … Read more Introducing Mark Padgham, rOpenSci’s new Software Research Scientist

Advent of Code 2019-03 with R & JavaScript

[This article was first published on Colin Fay, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Solving Advent of Code 2019-03 with R and JavaScript. [Disclaimer] Obviously, this … Read more Advent of Code 2019-03 with R & JavaScript

Improve General Regression Neural Network by Monotonic Binning

A major criticism on the binning algorithm as well as on the WoE transformation is that the use of binned predictors will decrease the model predictive power due to the loss of data granularity after the WoE transformation. While talk is cheap, I would use the example below to show that using the monotonic binning … Read more Improve General Regression Neural Network by Monotonic Binning

learnr 0.10.0

[This article was first published on RStudio Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. learnr 0.10.0 has been released! In this version of learnr, quiz questions … Read more learnr 0.10.0

Advent of Code 2019-02 with R & JavaScript

[This article was first published on Colin Fay, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Solving Advent of Code 2019-02 with R and JavaScript. [Disclaimer] Obviously, this … Read more Advent of Code 2019-02 with R & JavaScript

riddle by attrition

[This article was first published on R – Xi’an’s Og, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. The weekend riddle from The Riddler is rather straightforward [my … Read more riddle by attrition

Advent of Code 2019-01 with R & JavaScript

[This article was first published on Colin Fay, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Solving Advent of Code 2019-01 with R and JavaScript. [Disclaimer] Obviously, this … Read more Advent of Code 2019-01 with R & JavaScript

How You Measure Months Matters — A Lot. A Look At Two Implementations of KDA

This post will detail a rather important finding I found while implementing a generalized framework for momentum asset allocation backtests. Namely, that when computing momentum (and other financial measures for use in asset allocation, such as volatility and correlations), measuring formal months, from start to end, has a large effect on strategy performance. So, first … Read more How You Measure Months Matters — A Lot. A Look At Two Implementations of KDA

GRNN with Small Samples

After a bank launches a new product or acquires a new portfolio, the risk modeling team would often be faced with a challenge of how to estimate the corresponding performance, e.g. risk or loss, with a limited number of data points conditional on business drivers or macro-economic indicators. For instance, it is required to project … Read more GRNN with Small Samples

Updates on RSpectra: new “center” and “scale” parameters for svds()

[This article was first published on R on Yixuan’s Homepage, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Per the suggestion by @robmaz, RSpectra::svds() now has two new … Read more Updates on RSpectra: new “center” and “scale” parameters for svds()

Practical Data Science with R, 2nd Edition: Introduction Video

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Nina and I have prepared a quick introduction video for … Read more Practical Data Science with R, 2nd Edition: Introduction Video

Version control – The uncomplicated work on a common project

Whoever starts a job as a developer in 2019, be it in software development or in data science, data ops, etc., is usually confronted with a tool for version management relatively early. Programs such as Git, SVN and BitKeeper are primarily used to transparently wind forwards and rewind the development history or to develop new … Read more Version control – The uncomplicated work on a common project

Visualizing the Premier League So Far, Part 2: Stats from Open Play and Set Pieces

[This article was first published on R by R(yo), and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. In Part 2, I will be looking into various statistics from … Read more Visualizing the Premier League So Far, Part 2: Stats from Open Play and Set Pieces