Analyzing data from COVID19 R package

[This article was first published on R | TypeThePipe, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. The idea behind this post was to play and discover some … Read more

Categories R Tags ExcerptFavorite

Polygons to hexagons

[This article was first published on r.iresmi.net, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Hexagon tessellation using the great geogrid package. The départements are the second level … Read more

Categories R Tags ExcerptFavorite

Dialogues with Databases

About the speaker Kathryn Hume leads product and business development for Borealis AI, the machine learning research lab for the Royal Bank of Canada. Prior to joining Borealis AI, Kathryn held leadership positions at integrate.ai and Fast Forward Labs (acquired by Cloudera). She has helped over 50 Fortune 500 companies develop machine learning applications and … Read more

Choosing between BigQuery on-demand and flat rate pricingChoosing between BigQuery on-demand and flat rate pricingSpecialist Customer Engineer, Data Analytics

Editor’s note: This is one installment in a series about managing BigQuery costs. Check out the other posts on using Reservations effectively and how to use Flex Slots to save on costs. When you use data to guide your business decision-making process, you need to continually optimize your data analytics usage to get more out … Read more

Why R? Webinar –

[This article was first published on http://r-addict.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. May 28th (8:00pm UTC+2) will bring another fascinating Webinar at Why R? Foundation. … Read more

Categories R Tags ExcerptFavorite

Virtual Build spotlights IoT updates and rollouts

As people around the globe adapt to new ways of working, the Microsoft Build 2020 conference took a new approach as well. Rather than gathering the developer community in person as planned, Microsoft shifted gears and put together 48 hours of streaming content for a virtual event. Despite the new format, Microsoft Build’s goals remained … Read more

Azure Lighthouse—managing cloud, hybrid, and edge environments at-scale through a single control plane

Thousands of partners and enterprises use Azure Lighthouse to manage services across Azure tenants, representing tens of thousands of subscriptions and more than one million Azure resources from Azure Resource Manager—a unified control plane. With Azure Lighthouse, service providers, as well as self-managing enterprises, can achieve higher operational efficiency using Azure’s comprehensive and robust management … Read more

End-to-End Machine Learning Project: Part-2

In the previous post, we saw how I trained an image classification model, starting from data preparation to training different iterations of the model, both using Convolutional Neural Networks(CNNs) and Transfer Learning to get a final model which classifies US dollar bills. If you haven’t already, I would suggest skimming through that post first and … Read more

Why you should consider automating your job search needs?

Using data extraction and automation techniques for identifying job opportunities Image by Author Disclaimer: The following article has been meant for educational purposes only. All the data extracted is publicly available on the Naukri.com website and can be accessed without any kind of login. If you want to read up about the legality of scraping, … Read more

The computers of nature

Cellular automata and the emergence of computation in natural systems The cloth of gold cone, a venomous species of sea snail. The pattern of its shell resembles that of a cellular automaton. (source: Wikipedia) One of the most important milestones in the history of computing was the idea of the universal Turing machine, a conceptualized … Read more

RSwitch 1.7.0 Has Been Released

[This article was first published on R – rud.is, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. I (and, apparently, Gandalf O_o) are pleased to announce that RSwitch … Read more

Categories R Tags ExcerptFavorite

4 Ways To Improve Your Plotly Graphs

These last weeks I’ve been working on an application using Dash and Plotly. These tools are great if you want to get something out quickly. But, as usual, there’s no magical make_beautiful_graphs parameter you can set to True by default. If you want to have beautiful and customized visualizations in your application, you’ll need to … Read more

StayAtHome — A Story of COVID-19

An Analysis of Trend and Perspective about StayAtHome Campaign Source https://unsplash.com/@anastasiiachepinska Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus [Source]. The first case of the novel coronavirus was reported in Wuhan, China, in late December 2019 [Source] and has spread globally since then. As a result, WHO has declared it … Read more

RTutor: How would Carbon Pricing Affect US Electricity Production?

One way by which carbon pricing can reduce CO2 emmissions is by shifting electricity production from coal power plants to natural gas plants. A combined cycle gas turbine emits roughly only half as much CO2 per MWh produced electricity than a coal power plant. (Unfortunately, a substantial part of those benefits may be negated unless … Read more

Categories R Tags ExcerptFavorite

Southern Africa Project 2020

[This article was first published on Blog on FORWARDS, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Using the satRday Johannesburg 2020conference as a springboard, Forwards set up … Read more

Categories R Tags ExcerptFavorite

Binary Classification Example

Finally, we have to scale the features in order to have all their values on the same range or magnitude. This step has to be done as most ML classifiers use Euclidean distance and features with higher magnitudes or range would have more influence on the prediction. For example temperature, 32 degree Fahrenheit is the … Read more

#3 T^4: Customizing The Shell

[This article was first published on Thinking inside the box , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. The third video (following the announcement, the shell colors) … Read more

Categories R Tags ExcerptFavorite

What is isotonic regression?

Isotonic regression is a method for obtaining a monotonic fit for 1-dimensional data. Let’s say we have data such that . (We assume no ties among the ‘s for simplicity.) Informally, isotonic regression looks for such that the ‘s approximate the ‘s well while being monotonically non-decreasing. Formally, the ‘s are the solution to the … Read more

Categories R Tags ExcerptFavorite

The Recurrent Neural Network (RNNs)

A recurrent neural network (RNN) is an input node (hidden layer) that feeds sigmoid activation. The way an RNN does this is to take the output of one neuron and return it as input to another neuron or feed the input of the current time step to the output of earlier time steps. Here you … Read more

RcppSimdJson 0.0.5: Updated Upstream

[This article was first published on Thinking inside the box , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. A new RcppSimdJson release with updated upstream simdjson code … Read more

Categories R Tags ExcerptFavorite

Serverless machine learning architecture on leading cloud platforms

Machine learning architecture on Amazon Web Services (AWS), Google Cloud Platform (GCP) and Microsoft Azure cloud platforms GCP, AWS, Azure The cloud platforms themselves have various services which can be mixed and matched to satisfy the need of any business case with the allocated budget. Here I am going to pick a generic example mentioned … Read more

Big Data 7: yorkr waltzes with Apache NiFi

In this post, I construct an end-to-end Apache NiFi pipeline with my R package yorkr. This post is a mirror of my earlier post Big Data-5: kNiFing through cricket data with yorkpy based on my Python package yorkpy. The  Apache NiFi Data Pipeilne  flows all the way from the source, where the data is obtained, … Read more

Categories R Tags ExcerptFavorite

AWS Solutions: Real-time Analytics with Spark Streaming now supports Spark SQL, Dataframes and more

The solution now includes an updated consumer application using the latest version of Spark and leverages modern features (such as Spark SQL and DataFrames), granular custom IAM policies, encryption at rest(default), flow logs to VPC, porting sample Spark streaming applications to Java (from Scala), and several maintenance upgrades such as updating Python to version 3.8 … Read more

Categories AWS ExcerptFavorite

Now Query for AWS Availability Zones and Local Zones using AWS Systems Manager Parameter Store

Parameter Store is a feature in Systems Manager providing secure, centralized storage for your application configuration data. AWS services publish information about common artifacts such as AMI IDs and Regions as public parameters. You can look up the available public parameters and call them from your scripts and code. With this launch, you can now … Read more

Categories AWS ExcerptFavorite

Create and convert tibbles

[This article was first published on Quantargo Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. Tibbles are the modern reimagination of data frames and share many commonalities … Read more

Categories R Tags ExcerptFavorite

AWS CodeBuild Test Reporting is now Generally Available

With Test Reporting, you can view historical trends, aggregated metrics, or details of test-failure for tests executed on AWS CodeBuild. These capabilities are supported for any test framework that outputs files for JUnit, Cucumber, TestNG, or TRX. Since the Test Reporting Beta, announced in November, 2019, we’ve introduced support for Tagging on Report Groups and … Read more

Categories AWS ExcerptFavorite

Data and Decision Models — How to use influence diagrams in a Data Science Project?

A step by step approach towards building an influence diagram Photo by NEW DATA SERVICES on Unsplash “Value Chain” is a widely used term that defines five key areas namely requirement analysis, design, implementation, testing, and evolution. Several such process models are relevant to data science however there is no right answer to which is … Read more

Custom Package Repositories in R

by Steve Belcher, Sr Technical Specialist, Microsoft Data & AI In some companies, R users can’t download R packages from CRAN. That might be because they work in an environment that’s isolated from the internet, or because company policy dictates that only specific R packages and/or package versions may be used. In this article, we … Read more

A complete guide to transfer learning from English to other Languages using Sentence Embeddings…

So far, we can see that SBERT can be used for information retrieval, clustering, automatic essay scoring, and for semantic textual similarity with incredible time and high accuracy. However, the limitation of SBERT is that it only supports English at the moment while leave blank for other languages. To solve that, we can use the … Read more

Model Selection in Text Classification

Source: Image by author At the beginning, there was a simple problem. My manager came to me to ask if we could classified mails and associated documents with NLP methods. Sound not very funky but I’ll start with thousands of sample. The first thing asked was to use “XGBoost” because: “We can do everything with … Read more

Data Pipeline in GCP: Cloud Function Basics

Source: Google Cloud Most Data Scientists, prefer to own the end to end data pipeline of their models, but owning a pipeline requires a lot of engineering effort. In this article, I will talk about Cloud Function, which is a serverless, easy, and cost-effective option. GCP provides a simple scheduling tool called “Cloud Scheduler”. From … Read more

AdaOpt (a probabilistic classifier based on a mix of multivariable optimization and a nearest neighbors) for R

Last week on this blog, I presented AdaOpt for Python on a handwritten digits classification task. AdaOpt is a novel probabilistic classifier, based on a mix of multivariable optimization and a nearest neighbors algorithm. It’s still very new and only time will allow to fully appreciate all of its features. The tool is fast due … Read more

Categories R Tags ExcerptFavorite

a new Monty Hall riddle

[This article was first published on R – Xi’an’s Og, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t. The Riddler was sort of feeling the rising boredom of … Read more

Categories R Tags ExcerptFavorite

AWS Marketplace launches rapid data delivery for Sellers and Consulting Partners

Starting today, sellers can review and analyze critical financial, offering, tax and buyer information within one day. For example, disbursements data will be available within a day after the disbursement is complete. Sellers can ingest the data fields into AWS tools, such as Amazon Athena or Amazon QuickSight, or into their organization’s preferred reporting tools … Read more

Categories AWS ExcerptFavorite

Zero-trust remote admin access for Windows VMs on Compute EngineZero-trust remote admin access for Windows VMs on Compute EngineProduct Manager, Google CloudSolutions Architect

It’s more important than ever for IT administrators to be able to securely access resources from wherever they are. Exposing VM instances to the public internet can be risky, potentially giving bad actors a direct access path to your network. But solutions such as VPN tunnels or jump (bastion) hosts to access these systems can … Read more

Burst data lake processing to Dataproc using on-prem Hadoop dataBurst data lake processing to Dataproc using on-prem Hadoop dataLead Engineer at Alluxio

Many companies have data stored in a Hadoop Distributed File System (HDFS) cluster in their on-premises environment. As the amount of stored data grows, and the number of workloads coming in from analytics frameworks like Apache Spark, Presto, Apache Hive, and more grow, this type of fixed on-premises infrastructure becomes costly and causes latency in … Read more

Cloud Functions, meet VPC functionalityCloud Functions, meet VPC functionalitySolution ArchitectProduct Manager

Ingress settings and VPC Service Controls With the release of Serverless VPC Access, we provided an egress path from functions to services running in a VPC (e.g.,  stateful services running on VMs without public IP addresses or services such as Memorystore). Now with support for ingress settings, you have more control over which network requests … Read more

3 Types of Sequence Prediction Problems

You can try to build your time-series forecasting model with LSTM or ARIMA on the Air Passengers dataset or try out this TensorFlow.js demo. Webpage/product recommendation Have you searched for something, and every advertisement you saw next is related to what you searched for? For example, after watching the movie Avengers: Endgame, I was searching … Read more

Meeting the challenges of today and tomorrow with Azure AI

It’s inspiring to see how customers continue to reimagine how they work with the help of AI, which is more important today than ever. Our customers are finding innovative ways to deliver crisis management solutions, drive cost-savings, redefine customer engagement, and accelerate decision-making. Here are some notable examples we’ve recently seen: Scaling crisis management On … Read more

Azure Synapse Analytics: Insights for all

In November we announced Azure Synapse Analytics—a limitless analytics service that brings together enterprise data warehousing and big data analytics. Despite the massive disruptions occurring throughout industries in recent months, we continue to see a high level of excitement and enthusiasm for joining the private preview to use analytics in these times of crisis. Thank … Read more

Azure Stack updates and how it enables intelligence at the edge

Today, more than ever before, it is essential that our colleagues, customers, and partners be able to react quickly and confidently to rapidly changing circumstances. The ability to ingest, analyze, and act on incoming information requires that an organization have a robust, scalable technology infrastructure. Such an infrastructure is not limited to one place or … Read more