Organize Why R? 2019 pre-meeting in your city

[This article was first published on http://r-addict.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Why R? pre-meetings are R meetups that support local R groups. They promote … Read moreOrganize Why R? 2019 pre-meeting in your city

‘mRpostman’ – IMAP Tools for R in a Tidy Way

mRpostman is an R package to help you to easy connect to your IMAP (Internet Message Access Protocol) server and execute commands, such as listing mailboxes, searching and fetching messages in a tidy way. It calls ‘curl’ in background when issuing the IMAP commands (all credit to Jeroen Ooms and Daniel Stenberg). So far, I … Read more‘mRpostman’ – IMAP Tools for R in a Tidy Way

Quick Hit: A new 64-bit Swift 5 RSwitch App

At the bottom of the R for macOS Developer’s Page there’s mention of an “other binary” called “RSwitch” that is “a small GUI that allows you to switch between R versions quickly (if you have multiple versions of R framework installed).” Said switching requires you to use the “tar.gz” versions of R from the R … Read moreQuick Hit: A new 64-bit Swift 5 RSwitch App

So baut man Analytikplattformen – Teil 2: Intelligentes Benutzer- und Rollenkonzept

What does a modern analytics platform need to offer companies real added value? Why is the administration of user and role rights a factor not to be underestimated when using analytics platforms? In the previous article, we showed how important an intuitive user interface and an open user group concept are for the company-wide use … Read moreSo baut man Analytikplattformen – Teil 2: Intelligentes Benutzer- und Rollenkonzept

How to do Topic Extraction from Customer Reviews in R

Topic Extraction is an integral part of IE (Information Extraction) from Corpus of Text to understand what are all the key things the corpus is talking about. While this can be achieved naively using unigrams and bigrams, a more intelligent way of doing it with an algorithm called RAKE is what we’re going to see … Read moreHow to do Topic Extraction from Customer Reviews in R

EARL London – speaker interview, Johannes Tang Kristensen

We sent Johannes Tang Kristensen from Arla Foods a few questions about his upcoming talk at EARL London – ‘How much milk do our cows produce? Lessons learned from putting our first R model into production’. How did the need for your project come about? The project started out as part of a larger initiative … Read moreEARL London – speaker interview, Johannes Tang Kristensen

Which Factors Influence Gas Prices? Do Gas Companies Narratives Hold True?

Like the data hunter he is, my STATWORX colleague Jakob came across a rich data source regarding gas station prices. While his focus has been on checking very common myths about gas prices (check out his blogpost!), he did a fantastic job at cleaning and preparing the raw data to get it in a usable … Read moreWhich Factors Influence Gas Prices? Do Gas Companies Narratives Hold True?

Why R? 2019 – Agenda Released + Regular Registration Ends Aug 31st!

[This article was first published on http://r-addict.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. A month ago we closed Call for Papers for Why R? 2019 Conference. … Read moreWhy R? 2019 – Agenda Released + Regular Registration Ends Aug 31st!

Visualizing Soccer with StatsBomb Data and R, Part 1: Simple xG and Pass Partner Plots!

[This article was first published on R by R(yo), and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. This will be Part 1 of what I hope to be … Read moreVisualizing Soccer with StatsBomb Data and R, Part 1: Simple xG and Pass Partner Plots!

Notes on Becoming an RStudio Certified Trainer

[This article was first published on R – AriLamstein.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. I recently became an RStudio Certified Trainer, and thought that it … Read moreNotes on Becoming an RStudio Certified Trainer

RcppQuantuccia 0.0.3

[This article was first published on Thinking inside the box , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. A maintenance release of RcppQuantuccia arrived on CRAN earlier … Read moreRcppQuantuccia 0.0.3

Introducing Open Forensic Science in R

The free online book Open Forensic Science in R was created to foster open science practices in the forensic science community. It is comprised of eight chapters: an introduction and seven chapters covering different areas of forensic science: the validation of DNA interpretation systems, firearms analysis of bullets and casings, latent fingerprints, shoe outsole impressions, … Read moreIntroducing Open Forensic Science in R

simstudy updated to version 0.1.14: implementing Markov chains

I’m developing study simulations that require me to generate a sequence of health status for a collection of individuals. In these simulations, individuals gradually grow sicker over time, though sometimes they recover slightly. To facilitate this, I am using a stochastic Markov process, where the probability of a health status at a particular time depends … Read moresimstudy updated to version 0.1.14: implementing Markov chains

Fitting ‘complex’ mixed models with ‘nlme’. Example #1

Fitting mixed models has become very common in biology and recent developments involve the manipulation of the variance-covariance matrix for random effects and residuals. To the best of my knowledge, within the frame of frequentist methods, the only freeware solution in R should be based on the ‘nlme’ package, as the ‘lmer’ package does not … Read moreFitting ‘complex’ mixed models with ‘nlme’. Example #1

Referring to POTUS on Twitter: a stance-based perspective on variation in the 116th House

[This article was first published on Jason Timm, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. In this post, we investigate how (& how often) members of the … Read moreReferring to POTUS on Twitter: a stance-based perspective on variation in the 116th House

Correspondence Analysis visualization using ggplot

[This article was first published on Rcrastinate, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. What we want to do Recently, I used a correspondence analysis from the … Read moreCorrespondence Analysis visualization using ggplot

What does a modern analytics platform need to offer companies real added value?

What does a modern analytics platform need to offer companies real added value? Currently, new, innovative platforms are sprouting up on the market again and again – implemented with technical competence and ideally suited to the respective analytical approaches. But the question arises: Is that enough? Is it enough to develop software that allows reliable … Read moreWhat does a modern analytics platform need to offer companies real added value?

Regular Sequences

[This article was first published on R-exercises, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. So far in this series, we used vectors from built-in datasets (rivers, women … Read moreRegular Sequences

Why Machine Learning is more Practical than Econometrics in the Real World

[This article was first published on R – Remix Institute, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Motivation I’ve read several studies and articles that claim Econometric … Read moreWhy Machine Learning is more Practical than Econometrics in the Real World

Missing Values In Dataframes With Inspectdf

[This article was first published on Alastair Rushworth, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Summarising NA by column in dataframes Exploring the number of records containing … Read moreMissing Values In Dataframes With Inspectdf

Modern R with the tidyverse is available on Leanpub

Yesterday I released an ebook on Leanpub,called Modern R with the tidyverse, which you can alsoread for free here. In this blog post, I want to give some context. Modern R with the tidyverse is the second ebook I release on Leanpub. I released the first one, calledFunctional programming and unit testing for data munging … Read moreModern R with the tidyverse is available on Leanpub

Tech Dividends, Part 2

[This article was first published on R Views, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. In a previous post, we explored the dividend history of stocks included … Read moreTech Dividends, Part 2

3 tidyverse tricks for most commonly used Excel Features

In this post, We’re simply going to see 5 tricks that could help improve your tooling using {tidyverse}. Create a difference variable between the current value and the next value This is also known as lead and lag – especially in a time series dataset this varaible becomes very important in feature engineering. In Excel, … Read more3 tidyverse tricks for most commonly used Excel Features

EARL London – speaker interview

Robert Duff (Transport for London) and Rahulan Chandrasekaran (Department for Transport) Robert and Rahulan are doing a joint presentation titled ‘Let me in! Let me on! Quantifying highly frustrating events on the Underground’ on 11 September at EARL London. We dropped Robert an email to find out more around the subject of his and Rahulan’s talk. … Read moreEARL London – speaker interview

What the MAPE is FALSELY blamed for, its TRUE weaknesses and BETTER alternatives!

A major problem arises when comparing forecasting methods and models across different time series. This is a challenge we regularly face at STATWORX. Unit dependent measures like the MAE (Mean Absolute Error) and the RMSE (Root Mean Squared Error) turn out to be unsuitable and hardly helpful if the time series is measured in different … Read moreWhat the MAPE is FALSELY blamed for, its TRUE weaknesses and BETTER alternatives!

Insurance data science : Networks

At the Summer School of the Swiss Association of Actuaries, in Lausanne, I will start talking about networks and insurance this Friday. Slides are available online Related If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook… Favorite

Fun with progress bars: Fish, daggers and the Star Wars trench run

If you’re like me, when running a process through a loop you’ll add in counters and progress indicators. That way you’ll know if it will take 5 minutes or much longer. It’s also good for debugging to know when the code wigged-out. This is typically what’s done. You take a time stamp at the start … Read moreFun with progress bars: Fish, daggers and the Star Wars trench run

Big Data: Wrangling 4.6M Rows with dtplyr (the NEW data.table backend for dplyr)

[This article was first published on business-science.io, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Wrangling Big Data is one of the best features of the R programming … Read moreBig Data: Wrangling 4.6M Rows with dtplyr (the NEW data.table backend for dplyr)

2 Months in 2 Minutes – rOpenSci News, August 2019

rOpenSci HQ Software Peer Review 5 community-contributed packages passed software peer review. gitignore – Create useful .gitignore files for your project. Author: Philippe Massicotte; Reviewers: Amanda Dobbyn, Mauro Lepore; Review grainchanger – Moving-Window and Direct Data Aggregation. Author: Laura Jane Graham; Reviewers: Max Joseph, John Baumgartner; Review popler – Browse and query the popler database. … Read more2 Months in 2 Minutes – rOpenSci News, August 2019

Prepping data for #rstats #tidyverse and a priori planning

messy data can be your friend (or frenemy) Many if not most data clean up, tidying, wrangling, and joining can be done directly in R. There are many advantages to this approach – i.e. read in data in whatever format (from excel to json to zip) and then do your tidying – including transparency, a … Read morePrepping data for #rstats #tidyverse and a priori planning

Using the lpSolve package in R to optimise an electricity system

Reducing carbon emissions is maybe the world’s most pressing challenge at the moment. One obvious avenue for action is the reduction of carbon emissions from electricity generation, which are a significant contributor to global carbon emissions overall. This is particularly true if trends now in place continue undisturbed, with the world relying on electricity to … Read moreUsing the lpSolve package in R to optimise an electricity system

What is vtreat?

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. vtreat is a DataFrame processor/conditioner that prepares real-world data for … Read moreWhat is vtreat?