Facts About Coronavirus Disease 2019 (COVID-19) in 5 Charts created with R and ggplot2

[This article was first published on novyden, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Introduction Coronovirus pandemic is changing our lifestyle from daily routine to near- and … Read more

Categories R Tags ExcerptFavorite

Concurrency in Python

After this fairly long, but necessary explanation, we are ready for some example code and experiments. Let’s get to work! Our test function We first define a function that we can use to benchmark our different options. All the following examples use the same function, called heavy : def heavy(n, myid):for x in range(1, n):for … Read more

The 2020 US Stock Market Drop was bad

Analyzing the severity of the COVID-19 related US stock market sell-off using data visualizations and a historic context. If you don’t know about the US stock market drop that occurred over the last 2 months, then you’re either: A college student, OR An entry-level worker with no investments and student loans worth a lifetime Either … Read more

The Impact of Paper Filters on Espresso

Vis-à-vis Aeropress Filters meet Espresso Machines In May of 2019, Scott Rao popularized an espresso technique for high extraction (>25% vs 18% to 22% range for a 3:1 shot) involving Aeropress filters (filtered espresso). At the time, I was deep into an exploration of the staccato espresso technique, and I didn’t want to add another … Read more

Less Known but Very Useful Pandas Functions

Expedite your data analysis process Photo by Erik Mclean on Unsplash Pandas is a highly-efficient and widely used data analysis tool. The core data structure of Pandas is DataFrame which represents data in tabular form with labeled rows and columns. There are many powerful and flexible functions and methods of DataFrame that ease and expedite … Read more

Bioinformatics: How AI Can Contribute to the Study of Life

Often, complex biological phenomena involve many biological aspects, and hence cannot be explained in one data type. For this reason, bioinformatics involves an interlinked analysis of several different data types, and should give a holistic understanding of complicated biological phenomena. Machine learning has been used in bioinformatics for prediction and discovery — with the rise … Read more

AWS Storage Gateway adds audit logs for File Gateway to address enterprise compliance requirements

File Gateway provides on-premises applications with file-based, cached access to virtually unlimited cloud storage using SMB and NFS protocols. With this launch, SMB client operations for files and folders are logged to provide IT administrators and compliance managers with the information they need about user access to files and folders. Key operations logged for files … Read more

Categories AWS ExcerptFavorite

My First Month As A Computer Vision Engineer

The key deliverable for the company comes in the form of an iOS mobile application. Photo by Roozbeh Eslami on Unsplash I’ve had no exposure to developing an iOS mobile application, so my first month involved me getting familiar with Xcode, which is an IDE for developing mobile apps, and also Swift, which is a … Read more

Python and computer vision

First of all, I want to give credit to Adrian Rosebrock and his amazing website, which I highly recommend you to explore in case you want to learn more about computer vision. Furthermore, this article assumes that you have already completed the installation of dlib . If not, do not worry, details can be found … Read more

foreach 1.5.0 now available on CRAN

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. This post is to announce that version 1.5.0 of the foreach package is … Read more

Categories R Tags ExcerptFavorite

Introducing Service Directory: Manage all your services in one place at scaleIntroducing Service Directory: Manage all your services in one place at scaleSoftware EngineerProduct Manager

Simplify service management and operations Service Directory reduces the complexity of management and operations by providing unified visibility for all your services across cloud and on-premises environments. And because Service Directory is fully managed, you get enhanced service inventory management at scale with no operational overhead, increasing the productivity of your DevOps teams. At the … Read more

Scaling Machine Learning

About the speaker: Razvan Peteanu’s current role is Lead Architect — Machine Learning at TD Securities. He has 25 years of experience in software development, mostly in the financial industry. His focus over the last several years has been on building scalable machine learning solutions, in the cloud or on premise. About the talk: Some … Read more

COVID-19 in Belgium

The Novel COVID-19 Coronavirus is still spreading quickly in several countries and it does not seem like it is going to stop anytime soon as the peak has not yet been reached in many countries. Since the beginning of its expansion, a large number of scientists across the world have been analyzing this Coronavirus from … Read more

Blogging A to Z: The A to Z of tidyverse

[This article was first published on Deeply Trivial, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Announcing my theme for this year’s blogging A to Z! The tidyverse … Read more

Categories R Tags ExcerptFavorite

Visualizing decision tree partition and decision boundaries

Grant McDermott develop this new R package I had thought of: parttree parttree includes a set of simple functions for visualizing decision tree partitions in R with ggplot2. The package is not yet on CRAN, but can be installed from GitHub using: # install.packages(“remotes”) remotes::install_github(“grantmcdermott/parttree”) Using the familiar ggplot2 syntax, we can simply add decision tree boundaries to … Read more

Categories R Tags ExcerptFavorite

Updates to R GUIs: BlueSky, jamovi, JASP, & RKWard

[This article was first published on R – r4stats.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Graphical User Interfaces (GUIs) for the R language help beginners get … Read more

Categories R Tags ExcerptFavorite

Microsoft partners with the industry to unlock new 5G scenarios with Azure Edge Zones

Cloud, edge computing, and IoT are making strides to transform whole industries and create opportunities that weren’t possible just a few years ago. With the rise of 5G mobile connectivity, there are even more possibilities to deliver immersive, real-time experiences that have demanding, ultra-low latency, and connectivity requirements. 5G opens new frontiers with enhanced mobile … Read more

Contagiousness of COVID-19 Part I: Improvements of Mathematical Fitting (Guest Post)

Learning Machines proudly presents a guest post by Martijn Weterings from the Food and Natural Products research group of the Institute of Life Technologies at the University of Applied Sciences of Western Switzerland in Sion. The topic of this post will be the fitting with the R-package optim. Food? That sounds like a rather unlikely … Read more

Categories R Tags ExcerptFavorite

What is a dgCMatrix object made of? (sparse matrix format in R)

I’ve been working with sparse matrices in R recently (those created using Matrix::Matrix with the option sparse=TRUE) and found it difficult to track down documentation about what the slots in the matrix object are. This post describes the slots in a class dgCMatrix object. (Click here for full documentation of the Matrix package (and it … Read more

Categories R Tags ExcerptFavorite

Master Auto ML in Python — An Overview of the MLBox package

Learn about MLBox to quickly and efficiently train an automated machine learning pipeline for a classification problem in python. Photo by Crystal Kwok on Unsplash Today’s post is very special. It’s written in collaboration with Axel de Romblay the author of the MLBox Auto-ML package that has gained a lot of popularity these last years. … Read more

Boosting Showdown: Scikit-Learn vs XGBoost vs LightGBM vs CatBoost in Sentiment Classification

Which gradient boosting library will reign supreme in this head-to-head competition? Image Source Gradient boosting is one of the most popular machine learning techniques in recent years, dominating many Kaggle competitions with heterogeneous tabular data. Similar to random forest (if you are not familiar with this ensembling algorithm I suggest you read up on it), … Read more

Realistic Deepfakes in 5 Minutes on Colab

Ladies and gentlemen, Deepfake videos are so easy to create, that anyone can make one. You do not need a PhD, and you don’t have to train models for hours, you don’t even have to take a course on Generative Adversarial Network. All that you have to do is, to record a video of yourself, … Read more

Recommending Scotch Whisky

About a year ago, I earned a certificate in selling Scotch Whisky from taking a training course in The Scotch Whisky Experience in Scotland. Since I have a background in data science, it leads to me to think about how to sell Scotch whisky with data science. Whisky-tasting in The Scotch Whisky Experience BackgroundWhisky is … Read more

Close Encounters of the R Kind

[This article was first published on R Views, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Affiliation Harrison – Center for Strategic and Budgetary Analysis, Washington DC Cara … Read more

Categories R Tags ExcerptFavorite

Can unbalanced randomization improve power?

[This article was first published on ouR data generation, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Of course, we’re all thinking about one thing these days, so … Read more

Categories R Tags ExcerptFavorite

Streamlining Design and Maximizing Success for Agile Test and Learn

Test and learn analysis is used widely throughout industry as a way to quantitatively understand the preferences of a retailer’s customer base. Successful testing requires accurate assumptions (response rates, anticipated difference between variants) along with finely tuned execution parameters (audience size, test/control split, test iterations and timing). The test design phase is arguably the most … Read more

Why we are so tempted to go out of lane?

Work has conditioned us to (Photo: Randy Au) Earlier last week I was seriously thinking about discussing how data people need to be very cautious about what they’re saying in public about COVID-19. There is a lot of contextless data and charts bouncing around that acts like a magnet to us data folk. The temptation … Read more

Image Recognition vs Other Techniques in Predicting the Financial Market

Power-up time series analysis with CNN and how it compares with other techniques This work was done in collaboration with @Vesko Karadjov who always sparkles with brilliant ideas. The financial market deals with time series mostly, and there are tons of techniques and tools around time series prediction. What we going to do is to … Read more

Recursion in Python

In computer science, recursion is a method of finding solutions to problems using smaller solutions of the same problem. Recursive algorithms have applications in list sorting, binary tree traversal, path finding and much more. In this post, we will discuss a classic recursive procedure used to find the factorial of a natural number. Let’s get … Read more

Analyzing Remote Sensing Data using Image Segmentation

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. This post summarizes material posted as an Additional Topic to accompany the book … Read more

Categories R Tags ExcerptFavorite

AWS Canada (Central) Region Adds Third Availability Zone

This third AZ provides customers with additional flexibility to architect scalable, fault-tolerant, and highly available applications in AWS Canada (Central) Region, and will support additional AWS services in Canada. This launch brings AWS’s global AZ total to 70 AZs within 22 geographic Regions around the world. See our newly released blog.   Availability Zones located … Read more

Categories AWS ExcerptFavorite

R Tip: How To Look Up Matrix Values Quickly

[This article was first published on R – Win-Vector Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. R is a powerful data science language because, like Matlab, … Read more

Categories R Tags ExcerptFavorite

Ace the SQL Data Science Interview in less than 10 minutes

When data is stored in different tables, it can be tricky to join it back together in the way that is most appropriate. The most common SQL question attempts to gage your understanding of the difference between left/right/inner/outer joins. The graph below summarizes the difference between each type of join: Typical Graph Highlighting Left/Right/Inner/Outer Join … Read more

COVID-19 public dataset on GCP from cases in Italy

I. Data extraction The starting point is a pdf document with medical notes and images. Here are the steps taken to extract that data: Download the pdfs from the ISMIR website (manual process) Separate the pages of each case (manual process) Extract text using Google Cloud Vision API — DOCUMENT_TEXT_DETECTION. Function to parse json output … Read more

COVID-19 public dataset program: Making data freely accessible for better public outcomesCOVID-19 public dataset program: Making data freely accessible for better public outcomesBigQuery Product Manager and GIS LeadDeveloper Advocate

Data always plays a critical role in the ability to research, study, and combat public health emergencies, and nowhere is this more true than in the case of a global crisis. Access to data sets—and tools that can analyze that data at cloud scale—are increasingly essential to the research process, and are particularly necessary in … Read more

En-Lightning Reinforcement Learning

Building a DQN with PyTorch Lightning This article looks at using PyTorch Lightning for the exciting domain of Reinforcement Learning (RL). Here we are going to build a standard Deep Q Network (DQN) model using the classic CartPole gym environment to illustrate how to start using Lightning to build your RL models. In this article … Read more

Why R? Webinars

[This article was first published on http://r-addict.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Motivated by a successful turnaround of visits of Why R? 2019 keynote talksthat … Read more

Categories R Tags ExcerptFavorite

Screenager: screening times at bioRxiv

[This article was first published on Rstats – quantixed, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. When a preprint is uploaded to bioRxiv, it undergoes screening before … Read more

Categories R Tags ExcerptFavorite

Extending the power of Azure AI to Microsoft 365 users

Today, Yusuf Mehdi, Corporate Vice President of Modern Life and Devices, announced the availability of new Microsoft 365 Personal and Family subscriptions. In his blog, he shared a few examples of how Microsoft 365 is innovating to deliver experiences powered by artificial intelligence (AI) to billions of users every day. Whether through familiar products like … Read more

Coronavirus : spatially smoothed decease in France

[This article was first published on r.iresmi.net, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. # Carto décès COVID 19 France # avec lissage # sources —————————————————————– fichier_covid … Read more

Categories R Tags ExcerptFavorite

Debugging Jupyter Notebooks Will Boost Your Productivity

Some would rightfully point out that Jupyter already supports pdb and that’s enough, but as developers, we tend to prefer the easy way AKA GUI. PixieDebugger is the first visual Python debugger for Jupyter notebooks and includes a source editor, local variable inspector, console output, the ability to evaluate Python expressions in the current context, … Read more