Building a Shiny app to show the impact of vaccines

Debates about vaccines areongoing inmanycountries and the debate hasreblossomed in Denmark after we’ve had five recent occurrences ofmeasels. While that is nothing compared to the measles outbreakcurrently ravagingJapanit is still enough to worry the health authorities that it mightresult in an epidemic. Here we’ll use Shiny to create an app that shows theimpact of contagious … Read more

Categories R Tags ExcerptFavorite

Getting started with RMarkdown & trying to make it in the world of Kaggle. Join MünsteR for our next meetup!

In our next MünsteR R-user group meetup on Tuesday, April 9th, 2019, we will have two exciting talks: Getting started with RMarkdown and Trying to make it in the world of Kaggle! You can RSVP here: http://meetu.ps/e/Gg5th/w54bW/f Getting started with RMarkdown First, Niklas Wulms from the University Hospital, Münster will give an introduction to RMarkdown: … Read more

Categories R Tags ExcerptFavorite

Graph analysis using the tidyverse

It is because I am not a graph analysis expert that I though it important to write this article. For someone who thinks in terms of single rectangular data sets, it is a bit of a mental leap to understand how to apply tidy principles to a more robust object, such as a graph table. … Read more

Categories R Tags ExcerptFavorite

Cancer clusters and the Poisson distributions

On March 1, 2019, an article was published in Israel’s Ynetnews website, under the title “The curious case of the concentration of cancer”. The story reports on a concentration of cancer cases in the town of Rosh Ha’ayin in central Israel. In the past few years dozens of cases of cancer have been discovered in … Read more

Categories R Tags ExcerptFavorite

How to become a Mango

At Mango, we talk a lot about going on a ‘data-driven journey’ with your business. We’re passionate about data and getting the best use out of it. But for now, instead of looking at business journeys, I wanted to talk to the Mango team and find out how they started on their own ‘data journey’ – what … Read more

Categories R Tags ExcerptFavorite

dqrng v0.1.0: breaking changes

A new version of dqrng has made it onto the CRAN servers. This version brings two breaking changes, hence the “larger than usual” change in version number: An integer vector instead of a single int is used for seeding (Aaron Lun in #10) Single integer seeds lead to a different RNG state than before. dqrng::dqset_seed() … Read more

Categories R Tags ExcerptFavorite

Evolution works!

Source: Wikimedia Hamlet: Do you see yonder cloud that’s almost in shape of a camel?Polonius: By the mass, and ’tis like a camel, indeed.Hamlet: Methinks it is like a weasel.from Hamlet by William Shakespeare The best way to see how evolution works, is to watch it in action! You can watch the evolution of cars … Read more

Categories R Tags ExcerptFavorite

shapper is on CRAN, it’s an R wrapper over SHAP explainer for black-box models

Written by: Alicja Gosiewska In applied machine learning, there are opinions that we need to choose between interpretability and accuracy. However in field of the Interpretable Machine Learning, there are more and more new ideas for explaining black-box models. One of the best known method for local explanations is SHapley Additive exPlanations (SHAP). The SHAP … Read more

Categories R Tags ExcerptFavorite

Getting Help in R

Introduction In this post, we will learn about the different methods of getting help in R.Often, we get stuck while doing some analysis as either we do not know thecorrect function to use or its syntax. It is important for anyone who is newto R to know the right place to look for help. There … Read more

Categories R Tags ExcerptFavorite

Building Big Shiny Apps — A Workflow 2/2

Second part of the blog transcription of the talk I’ve given during the eposter session of the rstudio::conf(2019). Read the first part here. Building Big Shiny Apps: step by step Step 1: Designing Don’t rush into coding. I know you want to, because it’s what we like to do and what we are good at. … Read more

Categories R Tags ExcerptFavorite

Getting Your First Data Science Job

Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Chris Albon, Data Scientist at Devoted Health. Here is the podcast link. Hugo: Hi there, Chris, and welcome to DataFramed. Chris: Hey, how’s it going? Hugo: It’s great man. How are you? Chris: I’m good. This is like one of the first podcast I’ve … Read more

Categories R Tags ExcerptFavorite

The fastest cyclists of Europe live in …

The fastest cyclists of Europe live in … Analyzing STRAVA data to find out which city has the faster cyclists with R and R-shiny. My contribution to the shiny contest. A shiny app visualizing STRAVA segments in London and Paris and seeing where people are faster STRAVA is one of the most popular fitness tracking apps. It not … Read more

Categories R Tags ExcerptFavorite

Using parameters in Rmarkdown

Nothing new or original here, just something that I learned about quite recently that may be useful for others. One of my more “popular” code repositories, judging by Twitter, is – well, Twitter. It mostly contains Rmarkdown reports which summarise meetings and conferences by analysing usage of their associated Twitter hashtags. The reports follow a … Read more

Categories R Tags ExcerptFavorite

2019-01 A Geometry Engine Interface for ‘grid’

This report describes a new function in ‘grid’ called grobCoords and a new package called ‘gridGeometry’ that combines grobCoords with the ‘polyclip’ package to provide a geometry engine interface for ‘grid’. Paul Murrell Download Related To leave a comment for the author, please follow the link and comment on their blog: R – Stat Tech. … Read more

Categories R Tags ExcerptFavorite

Polished human cognitive characteristics chapter

It has been just over two years since I release the first draft of the Human cognitive characteristics chapter of my evidence-based software engineering book. As new material was discovered, it got added where it seemed to belong (at the time), no effort was invested in maintaining any degree of coherence. The plan was to … Read more

Categories R Tags ExcerptFavorite

CRAN Mirror “Security”

In the “Changes on CRAN” section of the latest version of the The R Journal (Vol. 10/2, December 2018) had this short blurb entitled “CRAN mirror security”: Currently, there are 100 official CRAN mirrors, 68 of which provide both secure downloads via ‘https’ and use secure mirroring from the CRAN master (via rsync through ssh … Read more

Categories R Tags ExcerptFavorite

Classification of historical newspapers content: a tutorial combining R, bash and Vowpal Wabbit

Can I get enough of historical newspapers data? Seems like I don’t. I already wrote four(1,2,3 and4) blog posts, butthere’s still a lot to explore. This blog post uses a new batch of data announced on twitter: For all who love to analyse text, the BnL released half a million of processed newspaper articles. Historical … Read more

Categories R Tags ExcerptFavorite

Efficient MCMC with Caching

This post is part of a running series on Bayesian MCMC tutorials. For updates, follow @StableMarkets. Metropolis Review Metropolis-Hastings is an MCMC algorithm for drawing samples from a distribution known up to a constant of proportionality, . Very briefly, the algorithm works by starting with some initial draw then running … Continue reading Efficient MCMC with Caching → … Read more

Categories R Tags ExcerptFavorite

Using the R Package Profvis on a Linear Model

Not all data scientists were computer scientists who discovered their exceptional data literacy skills. They come from all walks of life, and sometimes that can mean optimizing for data structures and performance isn’t the top priority. That’s perfectly fine! There may come a time where you find yourself executing a chunk of code and consciously … Read more

Categories R Tags ExcerptFavorite

rquery Substitution

The rquery R package has several places where the user can ask for what they have typed in to be substituted for a name or value stored in a variable. This becomes important as many of the rquery commands capture column names from un-executed code. So knowing if something is treated as a symbol/name (which … Read more

Categories R Tags ExcerptFavorite

Creating blazing fast pivot tables from R with data.table – now with subtotals using grouping sets

Data manipulation and aggregation is one of the classic tasks anyone working with data will come across. We of course can perform data transformation and aggregation with base R, but when speed and memory efficiency come into play, data.table is my package of choice. In this post we will look at of the fresh and … Read more

Categories R Tags ExcerptFavorite

Visualizing Bike Share Data (NiceRide)

This tutorial will cover exploring and visualizing data through 2018 for the Minneapolis, MN bike sharing service NiceRide. Part of what makes R incredible is the number of great packages. Part of what makes packages like ggmap and gganimate great is how they build on existing packages. First step, as always, is to include the … Read more

Categories R Tags ExcerptFavorite

My Shiny Dashboard, Milwaukee Beer

It’s been a dream of mine to break into the data science field, so prior to my move, I decided to add another project to my portfolio – a sleek Shiny dashboard. A brutal truth about this project was that I had to invest time in finding my own data and deciding what to do … Read more

Categories R Tags ExcerptFavorite

The delta method and its implementation in R

Suppose that you have a sample of a variable of interest, e.g. the heights of men in certain population, and for some obscured reason you are interest not in the mean height μ but in its square μ². How would you inference on μ², e.g. test a hypothesis or calculate a confidnce interval? The delta … Read more

Categories R Tags ExcerptFavorite

Powerball demystified

The US Powerball lottery hysteria took another step when no one won the big jackpot in the last draw that took place on October 20, 2018. So, the total jackpot is now 2.22 billion dollars. I am sure that you want to win this jackpot. I myself want to win it. Actually, there are two different … Read more

Categories R Tags ExcerptFavorite

R Journal publication

The R Journal is the open access, refereed journal of the R project for statistical computing. It features short to medium length articles covering topics that should be of interest to users or developers of R. Christoph Weiss, Gernot Roetzer and myself have joined forces to write an R package and the accompanied paper: Forecast … Read more

Categories R Tags ExcerptFavorite

A brief history of clinical trials

The earliest report of a clinical trial is probably provided in the Book of Daniel. Daniel and a group of other Jewish people who stayed at the palace of the king of Babylon, did not want to eat the king’s non-Kosher food and preferred a vegetarian diet. To show that vegetarian and Kosher diet is healthier, … Read more

Categories R Tags ExcerptFavorite

What is logistic in the logistic regression?

Suppose that you are interviewed for a data scientist role. You are asked about logistic regression, and you answer all sorts of questions: How to run it in Python, how would you perform feature selection, and how would you use it for prediction. For the last question you answer that if you have the estimated of the regression … Read more

Categories R Tags ExcerptFavorite

Binning Data in a Database

Roz King just wrote an interesting article on binning data (a common data analytics step) in a database. He compares a case-based approach (where the bin divisions are stuffed into code) with a join based approach. He shares code and timings. Best of all: rquery gets some attention and turns out to be the dominant … Read more

Categories R Tags ExcerptFavorite

Some R Packages for ROC Curves

In a recent post, I presented some of the theory underlying ROC curves, and outlined the history leading up to their present popularity for characterizing the performance of machine learning models. In this post, I describe how to search CRAN for packages to plot ROC curves, and highlight six useful packages. Although I began with … Read more

Categories R Tags ExcerptFavorite

htmlunitjars Updated to 2.34.0

The in-dev htmlunit package for javascript-“enabled” web-scraping without the need for Selenium, Splash or headless Chrome relies on the HtmlUnit library and said library just released version 2.34.0 with a wide array of changes that should make it possible to scrape more gnarly javascript-“enabled” sites. The Chrome emulation is now also on-par with Chrome 72 … Read more

Categories R Tags ExcerptFavorite

EARL London early bird tickets now on sale

Early bird tickets for the Enterprise Applications of the R Language Conference are now on sale! The EARL Conference is in its sixth year, its a cross-sector conference that focuses on the commercial use of the R programming language. Take a look at our highlights from last year: [embedded content] We are busy putting together … Read more

Categories R Tags ExcerptFavorite

drat All The ?! : Enabling Easier Package Discovery and Installation with Your Own CRAN-like Repo for Your Packages

I’ve got a work-in-progress drat-ified CRAN-like repo for (eventually) all my packages over at CINC (“CINC is not CRAN” and it also sounds like “sync”). This is in parallel with a co-location/migration of all my packages to SourceHut (just waiting for the sr.ht alpha API to be baked) and a self-hosted public Gitea instance. Everything … Read more

Categories R Tags ExcerptFavorite

RStudio Instructor Training

We are pleased to announce the launch of RStudio’s instructor training and certification program. Its goal is to help people apply modern evidence-based teaching practices to teach data science using R and RStudio’s products, and to help people who need such training find the trainers they need. Like the training programs for flight instructors, the … Read more

Categories R Tags ExcerptFavorite

A wee look at group_map and group_split in dplyr

Dplyr 0.8.0 launched recently, which you probably already know, but just in case you missed it.. Two new functions have been catching my eye : group_map and group_split. The aim of this post – take a first look at these and try and get a new blog post up on github before February is out. … Read more

Categories R Tags ExcerptFavorite

CDSBMexico: remember to apply for BioC2019 travel scholarships

This blog post was first published at the CDSBMexico website. #CDSBMexico: remember to apply for BioC2019 travel scholarships!! Due date is March 15thhttps://t.co/iegG0qQzwu Let us help you! Here we give you some ideas ?We can also give you feedback via Slack ✅#rstats #bioconductor @Bioconductor #bioc2019 #diversity #LatAm #rstatsES pic.twitter.com/EORg8d2Qxj — ComunidadBioInfo (@CDSBMexico) March 1, 2019 … Read more

Categories R Tags ExcerptFavorite

KDA–Robustness Results

This post will display some robustness results for KDA asset allocation. Ultimately, the two canary instruments fare much better using the original filter weights in Defensive Asset Allocation than in other variants of the weights for the filter. While this isn’t as worrying (the filter most likely was created that way and paired with those … Read more

Categories R Tags ExcerptFavorite

Robust Regressions: Dealing with Outliers in R

Categories Regression Models Tags Machine Learning Outlier R Programming Video Tutorials It is often the case that a dataset contains significant outliers – or observations that are significantly out of range from the majority of other observations in our dataset. Let us see how we can use robust regressions to deal with this issue. I … Read more

Categories R Tags ExcerptFavorite

handlr: convert among citation formats

Citations are a crucial piece of scholarly work. They hold metadata on each scholarly work, including what people were involved, what year the work was published, where it was published, and more. The links between citations facilitate insight into many questions about scholarly work. Citations come in many different formats including BibTex, RIS, JATS, and … Read more

Categories R Tags ExcerptFavorite

“If You Were an R Function, What Function Would You Be?”

We’ve been getting some good uptake on our piping in R article announcement. The article is necessarily a bit technical. But one of its key points comes from the observation that piping into names is a special opportunity to give general objects the following personality quiz: “If you were an R function, what function would … Read more

Categories R Tags ExcerptFavorite

R-Trainings in Hamburg – Register now!

With more than 1,500 satisfied participants, eodas R-trainings are the leading courses for the programming language in the German-speaking region. In May, 2019, we bring our popular courses „Introduction to R“ and „Introduction to Machine Learning with R“ to Hamburg again. What you can look forward to? Our program at a glance:   May 14th – 15th|Introduction to … Read more

Categories R Tags ExcerptFavorite

Customers who bought…

One of the classic examples in data science (called data mining at the time) is the beer and diapers example: when a big supermarket chain started analyzing their sales data they encountered not only trivial patterns, like toothbrushes and toothpaste being bought together, but also quite strange combinations like beer and diapers. Now, the trivial … Read more

Categories R Tags ExcerptFavorite

Using Rstudio Jobs for training many models in parallel

Recently, Rstudio added the Jobs feature, which allows you to run R scripts in the background. Computations are done in a separate R session that is not interactive, but just runs the script. In the meantime your regular R session stays live so you can do other work while waiting for the Job to complete. … Read more

Categories R Tags ExcerptFavorite

Making thematic maps for Belgium

Related R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more… If you got this far, why not subscribe for updates from … Read more

Categories R Tags ExcerptFavorite