Scraping a table in a PDF, reliably and then test data quality

How to scrape a table within a PDF in Python, unit test the data for quality and then upload it to S3. Photo by Tim Mossholder on Unsplash Suppose you need to ingest some data into your data warehouse and after further discussions with your stakeholders the source of this data is a PDF document. … Read more Scraping a table in a PDF, reliably and then test data quality

Amazon AppFlow now supports AWS CloudFormation

Amazon AppFlow now supports AWS CloudFormation for creating and configuring Amazon AppFlow resources such as Connector profile and Amazon AppFlow Flow along with the rest of your AWS infrastructure—in a secure, efficient, and repeatable way. Amazon AppFlow is a fully managed integration service that enables customers to securely transfer data between AWS services and software-as-a-service … Read more Amazon AppFlow now supports AWS CloudFormation

Machine Learning on AWS SageMaker

Before we jump into this, let’s explain what we need to have in place — I’ll be quick, promise! Setup preparation Amazon S3 Amazon S3 is a storage service allowing us to store and protect our data in directories (Buckets). We will need this service to go forward Buckets: is a container for objects stored … Read more Machine Learning on AWS SageMaker

AWS Secrets Manager has been OSPAR assessed and approved

Security and compliance, including OSPAR, is a shared responsibility between AWS and you. For example, it is your responsibility to configure and manage secrets stored in Secrets Manager to meet ABS Guidelines. To learn more about the actions you may need to take to meet ABS Guidelines, read the AWS Cloud Compliance and OSPAR compliance … Read more AWS Secrets Manager has been OSPAR assessed and approved

gVisor: Protecting GKE and serverless users in the real worldgVisor: Protecting GKE and serverless users in the real worldVP Infrastructure and Fellow, Google Cloud

gVisor takes inspiration from a common principle in security that states that you should have multiple distinct layers of protection, and that those layers should not be susceptible to the same kinds of compromises. Containers rely on namespaces and cgroups as their primary layer of isolation; gVisor then introduces a second layer by handling syscalls … Read more gVisor: Protecting GKE and serverless users in the real worldgVisor: Protecting GKE and serverless users in the real worldVP Infrastructure and Fellow, Google Cloud

Amazon Comprehend now helps you mask personally identifiable information from text documents

Amazon Comprehend is a natural language processing (NLP) service that uses machine learning to find insights and relationships in text. It provides pre-trained models for recognizing entities, key phrases, sentiments, and other common elements in a document. You can also build custom models with Amazon Comprehend to recognize custom entities and classify documents.  Amazon Comprehend … Read more Amazon Comprehend now helps you mask personally identifiable information from text documents

Tips and tricks for using new RegEx support in Cloud LoggingTips and tricks for using new RegEx support in Cloud LoggingSoftware EngineerProduct Manager, Google Cloud

Pro tip: you can paste a timestamp like the one below directly into the field for custom time.  Original search: “CONNECTING” Specific search: timestamp>=”2019-08-05T18:34:19.856588299Z”timestamp<=”2019-09-05T18:34:19.856588299Z””CONNECTING” Put highly-queried data into indexed fields. You can use the Cloud Logging agent to route log data to indexed fields for improved performance, for example. Placing indexed data in the “labels” LogEntry … Read more Tips and tricks for using new RegEx support in Cloud LoggingTips and tricks for using new RegEx support in Cloud LoggingSoftware EngineerProduct Manager, Google Cloud

Data warehouse migration tips: preparation and discoveryData warehouse migration tips: preparation and discoveryEMEA Solution Lead, Data AnalyticsEMEA Solution Lead, Data Analytics

Data warehouses are at the heart of an organization’s decision making process, which is why many businesses are moving away from the siloed approach of traditional data warehouses to a modern data warehouse that provides advanced capabilities to meet changing requirements. At Google Cloud, we often work with customers on data warehouse migration projects, including … Read more Data warehouse migration tips: preparation and discoveryData warehouse migration tips: preparation and discoveryEMEA Solution Lead, Data AnalyticsEMEA Solution Lead, Data Analytics

Amazon Kinesis Data Analytics is now available in the Europe (Milan) AWS region

Amazon Kinesis Data Analytics is the easiest way to transform and analyze streaming data in real time with Apache Flink. Apache Flink is an open source framework and engine for processing data streams. Amazon Kinesis Data Analytics reduces the complexity of building and managing Apache Flink applications. Amazon Kinesis Data Analytics for Apache Flink integrates … Read more Amazon Kinesis Data Analytics is now available in the Europe (Milan) AWS region

Export data from Cloud SQL without performance overheadExport data from Cloud SQL without performance overheadProduct Manager, Google Cloud Platform

While there are a variety of reasons to export data out of your databases – such as to maintain backups, meet regulatory data retention policies, or feed downstream analytics – exports can put undue strain on your production systems, making them challenging to schedule and manage. To eliminate that resource strain, we’ve launched a new … Read more Export data from Cloud SQL without performance overheadExport data from Cloud SQL without performance overheadProduct Manager, Google Cloud Platform

Azure Container Instances – Docker integration now in Docker Desktop stable release

We’re happy to announce the new stable release of Docker Desktop includes the Azure Container Instances – Docker integration. Install or update to the latest release and get started deploying containers to Azure Container Instances (ACI) today. Azure Docker integration The Azure Docker integration enables you to deploy serverless containers to Azure Container Instances (ACI) … Read more Azure Container Instances – Docker integration now in Docker Desktop stable release

Build a scalable security practice with Azure Lighthouse and Azure Sentinel

The Microsoft Azure Lighthouse product group is excited to launch a blog series covering areas in Azure Lighthouse where we are investing to make our service provider partners and enterprise customers successful with Azure. Our first blog in this series covers a top area of consideration for companies worldwide—Security with focus on how Azure Lighthouse … Read more Build a scalable security practice with Azure Lighthouse and Azure Sentinel

Azure NetApp Files cross region replication and new enhancements in preview

As businesses continue to adapt to the realities of the current environment, operational resilience has never been more important. As a result, a growing number of customers have accelerated a move to the cloud, using Microsoft Azure NetApp Files to power critical pieces of their IT infrastructure, like Virtual Desktop Infrastructure, SAP applications, and mission-critical … Read more Azure NetApp Files cross region replication and new enhancements in preview

Deutsche Bӧrse Group continues its journey to the cloudDeutsche Bӧrse Group continues its journey to the cloudManaging Director, Google Cloud DACHGeneral Manager, Google Cloud Compute

The word “transformation” brings many things to mind, like innovation, agility, and change. Consistency and stability are probably not as high on the list of synonyms, but for regulated industries undergoing digital transformation initiatives, those characteristics are just as critical—in fact, they’re critically important for digital transformation to succeed. Deutsche Bӧrse Group, an international financial … Read more Deutsche Bӧrse Group continues its journey to the cloudDeutsche Bӧrse Group continues its journey to the cloudManaging Director, Google Cloud DACHGeneral Manager, Google Cloud Compute

Amazon Route 53 Resolver Now Supports VPC DNS Query Logging in AWS GovCloud (US) Regions

Route 53 Resolver is the Amazon DNS server (also sometimes referred to as “AmazonProvidedDNS” or the “.2 resolver”) that is available by default in all Amazon VPCs. Route 53 Resolver responds to DNS queries from AWS resources within a VPC for public DNS records, Amazon VPC-specific DNS names, and Amazon Route 53 private hosted zones. … Read more Amazon Route 53 Resolver Now Supports VPC DNS Query Logging in AWS GovCloud (US) Regions

NFS 4.1 support for Azure Files is now in preview

Azure Files is a distributed cloud file system serving file system SMB and REST protocols generally available since 2015. Customers love how Azure Files enables them to easily lift and shift their legacy workloads to the cloud without any modifications or changes in technology. SMB works great on both Windows and UNIX operating systems for … Read more NFS 4.1 support for Azure Files is now in preview

Preparing for what’s next: Building landing zones for successful cloud migrations

As businesses look to the cloud to ensure business resiliency and to spur innovation, we continue to see customer migrations to Azure accelerate. Increasingly, we’ve heard from business leaders preparing to migrate that they could learn from our best practices and want general help thinking about migration, and we started a blog series to help share … Read more Preparing for what’s next: Building landing zones for successful cloud migrations

Amazon AppFlow now supports new data formats for ingesting files into Amazon S3

Amazon AppFlow, a fully managed integration service that enables customers to securely transfer data between AWS services and software-as-a-service (SaaS) applications, now offers customers the flexibility to choose json, comma-separated values (CSV), or parquet as the file format when transferring data from a source application to Amazon S3. This feature is supported for all source … Read more Amazon AppFlow now supports new data formats for ingesting files into Amazon S3

Analyze your logs quickly with suggested queries beta in Cloud LoggingAnalyze your logs quickly with suggested queries beta in Cloud LoggingProduct Manager

Cloud Logging is a popular tool to help developers, operators, and other users identify and find the root cause of issues in their infrastructure. With features like the Logs Explorer, you can quickly and efficiently retrieve, view, and analyze logs. To help you get the most out of your logs, we’re excited to introduce suggested … Read more Analyze your logs quickly with suggested queries beta in Cloud LoggingAnalyze your logs quickly with suggested queries beta in Cloud LoggingProduct Manager

Better outcomes with AI: Frost & Sullivan names Microsoft the leading AI platform for healthcare IT

In early 2020, Frost & Sullivan recognized Microsoft as the “undisputed leader” in global Artificial Intelligence (AI) platforms for the Healthcare IT (HCIT) sector on the Frost Radar™. In a field of more than 200 global industry participants, Frost & Sullivan independently plotted the top 20 companies across various parameters indicative of growth and innovation, … Read more Better outcomes with AI: Frost & Sullivan names Microsoft the leading AI platform for healthcare IT

Discord notification using CloudWatch Alarms, SNS and AWS Lambda

Select Metric First of all, you will need to choose a CloudWatch metric for the alarm to watch. For the Lambda Function there are 3 types of metrics: Invocation Metrics: binary indicators of the outcome of an invocation. Examples: Invocations, Errors, DeadLetterErrors, DestinationDeliveryFailures, Throttles. Performance Metrics: performance details about a single invocation. Such as: Duration, … Read more Discord notification using CloudWatch Alarms, SNS and AWS Lambda

Amazon S3 bucket owner condition helps to validate correct bucket ownership

S3 Request APIs can now include an optional bucket ownership condition parameter containing an AWS Account ID, that helps customers to verify that a specified AWS Account ID is associated with the bucket they are communicating with. When bucket owner condition is used, S3 API requests will only succeed if the bucket owner matches the … Read more Amazon S3 bucket owner condition helps to validate correct bucket ownership

Amazon CloudWatch Synthetics now supports enhanced monitoring for Broken Link and GUI Workflow Blueprints

Broken password reset links, or misconfigured buttons preventing customers from taking an action often go unnoticed unless reported by end customers. With CloudWatch Synthetics, you can continuously verify your customer experience even when there is no customer traffic on your web applications. This lets you discover issues before your customers do and react quickly to … Read more Amazon CloudWatch Synthetics now supports enhanced monitoring for Broken Link and GUI Workflow Blueprints

Faster, more powerful apps for everyone: What happened at Next OnAir this weekFaster, more powerful apps for everyone: What happened at Next OnAir this weekDirector, Product Marketing, Data and Business Application Platform

Week nine of Google Cloud Next ‘20: OnAir is in the books—and what a week it was! From Google Cloud GM/VP Amit Zavery’s keynote to product announcements to customer demos to Google SVP Urs Hölzle presentation and Q&A, we explored multiple ways enterprises are leveraging APIs and no-code application development to accelerate their digital transformations. … Read more Faster, more powerful apps for everyone: What happened at Next OnAir this weekFaster, more powerful apps for everyone: What happened at Next OnAir this weekDirector, Product Marketing, Data and Business Application Platform

Analytics get smarter for SAP customers with Informatica and Google CloudAnalytics get smarter for SAP customers with Informatica and Google CloudPartner Technical Lead for SAP Strategy & Architecture

Like many businesses around the world, SAP customers are adapting to new realities brought about not only by the current health crisis, but also by sudden competitive landscape shifts. As they pivot their business models and face major decisions, they need the ability to dig into massive amounts of data using analytics.  But integrating enterprise … Read more Analytics get smarter for SAP customers with Informatica and Google CloudAnalytics get smarter for SAP customers with Informatica and Google CloudPartner Technical Lead for SAP Strategy & Architecture

Lost in translation: encryption, key management, and real securityLost in translation: encryption, key management, and real securityHead of Solutions StrategyProduct Manager

Being compliant does not mean you’re secure. The definition of “being secure” varies too much across industries, organizations, and threat profiles—secure against what?—to ever match an external checklist. Because of this, compliance and security have developed a peculiar relationship. Regulation is meant to provide guidance, through accepted best practices, from the body of knowledge for … Read more Lost in translation: encryption, key management, and real securityLost in translation: encryption, key management, and real securityHead of Solutions StrategyProduct Manager

Announcing Data API for Amazon Redshift

Amazon Redshift can now be accessed using the built-in Data API, making it easy to build web-services based applications and integrating with services, including AWS Lambda, AWS AppSync, and AWS Cloud9. Redshift Data API simplifies data access, ingest, and egress from languages supported with AWS SDK such as Python, Go, Java, Node.js, PHP, Ruby, and … Read more Announcing Data API for Amazon Redshift

Spanning the globe with Google Cloud VMware EngineSpanning the globe with Google Cloud VMware EngineGroup Product Manager, Google Cloud VMware Engine

VMware Engine is helping customers such as Mitel accelerate its cloud journey and unlock new growth opportunities. “Google Cloud VMware Engine shortened our implementation cycle — we can move our customers in just a few weeks (not months) to a well known, stateful environment in Google Cloud,” said Rick Cirigliano, Senior VP of Cloud Operations … Read more Spanning the globe with Google Cloud VMware EngineSpanning the globe with Google Cloud VMware EngineGroup Product Manager, Google Cloud VMware Engine

Amazon EKS now supports assigning EC2 security groups to Kubernetes pods

Previously, all pods on a node shared the same security groups. While IAM roles for service accounts solves the pod level security challenge at the authentication layer, many organization’s compliance requirements also mandate network segmentation as an additional defense in depth step. Kubernetes network policies provide an option for controlling network traffic within the cluster, … Read more Amazon EKS now supports assigning EC2 security groups to Kubernetes pods

Azure Files SMB Multichannel provides improved performance for clients

Server Message Block (SMB) 3.0 introduced SMB Multichannel technology for Windows Server 2012 and Windows 8 client. This feature allows SMB 3.x clients to establish multiple network connections to the SMB server 3.0 for greater performance over multiple network adapters and/or by taking advantage of NIC Receive Side Scaling (RSS).  Today, we are announcing the … Read more Azure Files SMB Multichannel provides improved performance for clients

How APIs and ecosystem strategies accelerate digital transformationHow APIs and ecosystem strategies accelerate digital transformationDirector of Product Management, Google Cloud

Enabling New Business Models  Many Google Cloud customers demonstrate the power of ecosystem strategies to expand innovation efforts and unlock new business models.  Indonesia’s Bank BRI, for example, made capabilities such as credit scoring available via an API as a paid service, which helped it to develop a digital network of “branchless” agents across many … Read more How APIs and ecosystem strategies accelerate digital transformationHow APIs and ecosystem strategies accelerate digital transformationDirector of Product Management, Google Cloud

EKS Now Supports Creation and Management of Fargate Profiles Using AWS CloudFormation

EKS Fargate profiles define which pods for your Amazon EKS clusters run on AWS Fargate, the AWS managed compute engine for containers. Previously, it was only possible to create and manage Fargate profiles using the EKS API or Console.   Now, you can create and manage Fargate profiles using AWS CloudFormation. This means that you … Read more EKS Now Supports Creation and Management of Fargate Profiles Using AWS CloudFormation

Google Cloud named a Leader in first Forrester Wave: Public Cloud Development & Infrastructure Platforms for ANZGoogle Cloud named a Leader in first Forrester Wave: Public Cloud Development & Infrastructure Platforms for ANZVice President, ANZ at Google Cloud

The events of 2020 have only reinforced what we’ve heard from organisations across Australia and New Zealand (ANZ)—they’re turning to the cloud because they want to innovate faster, increase resilience, collaborate more efficiency, and deliver more value to their customers. Today, we’re proud to share that Google Cloud has been recognised by Forrester as a … Read more Google Cloud named a Leader in first Forrester Wave: Public Cloud Development & Infrastructure Platforms for ANZGoogle Cloud named a Leader in first Forrester Wave: Public Cloud Development & Infrastructure Platforms for ANZVice President, ANZ at Google Cloud

The Story of Data — Privacy By Design

Discuss the need for adopting frameworks like Privacy By Design very early in your data management life cycle Image by Author Every byte of data has a story to tell. The question is whether the story is being narrated accurately and securely. Usually, we focus sharply on the trends around data with a goal of … Read more The Story of Data — Privacy By Design

Expanding Google Cloud’s Confidential Computing portfolioExpanding Google Cloud’s Confidential Computing portfolioGeneral Manager/VP of Engineering, Cloud SecurityGeneral Manager/VP of Engineering, Application Modernization Platform

Bringing confidential computing to your container workloads As our customers move to modernize existing applications and build cloud-native ones, GKE is increasingly the foundation they use. Application modernization also presents the opportunity to modernize security, and as we looked at building our Confidential Computing portfolio, we wanted to deliver a new level of confidentiality and … Read more Expanding Google Cloud’s Confidential Computing portfolioExpanding Google Cloud’s Confidential Computing portfolioGeneral Manager/VP of Engineering, Cloud SecurityGeneral Manager/VP of Engineering, Application Modernization Platform

Simplify financial reporting with cost allocation—now in preview

Managing cloud costs can be challenging; especially if your organization needs to break down costs for internal chargeback. You might have separate business units, or you might need to facilitate external billing for distinct customer solutions. This becomes even more difficult when you employ shared services to reduce costs, since there may not be a … Read more Simplify financial reporting with cost allocation—now in preview

View your Azure Cache for Redis data in new Visual Studio Code extension

Azure Cache for Redis is an in-memory data store that is used to power fast, scalable applications. Now in preview, you can access all the caches under your Azure subscriptions and view their data with the new Azure Cache for Redis Visual Studio Code extension. With this new integration, you’ll be able to use Visual … Read more View your Azure Cache for Redis data in new Visual Studio Code extension

AWS Launch Wizard now supports SAP deployments with SUSE Linux Enterprise Server 15 SP1 and 12 SP5

AWS Launch Wizard offers a guided way of sizing, configuring, and deploying AWS resources for SAP HANA and SAP HANA-based Netweaver systems with a purpose built, easy to use wizard. The following table shows all of the operating systems currently supported for different SAP components that can be deployed with AWS Launch Wizard: AWS Launch … Read more AWS Launch Wizard now supports SAP deployments with SUSE Linux Enterprise Server 15 SP1 and 12 SP5

Meetings readiness checker APIs help developers ensure that end-users can join Amazon Chime SDK meetings from their devices

From the Amazon Chime SDK for JavaScript, a developer can call any of the nine meeting readiness checker methods. These consist of local tests for devices setup and network tests that confirm the application can connect to Amazon Chime by briefly joining and leaving a test Amazon Chime SDK meeting. When executing network tests, the … Read more Meetings readiness checker APIs help developers ensure that end-users can join Amazon Chime SDK meetings from their devices

Amazon Lightsail now offers new OS blueprints

In addition to providing compute instances preinstalled with your favorite OS, Lightsail bundles include storage and a generous amount of data transfer, so you have everything you need to get up and running, all for a fixed monthly price. After your bundles are deployed, Lightsail’s intuitive management console makes it easy to track metrics, create … Read more Amazon Lightsail now offers new OS blueprints

Building a COVID-19 Map using ELK

Probably most of you are familiar with Johns Hopkins University (JHU) map representing the current situation of the COVID-19 pandemic. Image of the Johns Hopkins University (JHU) map (Johns Hopkins University) This map has been developed using ArcGIS technology, that has come the facto standard for developing pandemic maps in a lot of cases like … Read more Building a COVID-19 Map using ELK

AWS X-Ray launches anomaly detection-based actionable insights in preview

With this feature, you can determine the root cause of the issue, visualize the upstream and downstream services affected by the anomaly, and understand its impact on your end users. You can also view the incident timeline to understand when the issue started and how it progressed. WS X-Ray Insights is available in the following … Read more AWS X-Ray launches anomaly detection-based actionable insights in preview

An AI gold mine: What happened at Google Cloud Next ‘20 OnAirAn AI gold mine: What happened at Google Cloud Next ‘20 OnAirProduct Marketing Lead, Cloud AI

AI and machine learning (ML) tools and solutions are fundamentally changing how businesses are run. Soon, organizations that are not using AI will be at a disadvantage from those that do and applications that are not AI-powered may feel broken.   This week at Google Cloud Next ’20: OnAir we explored how Cloud AI is empowering … Read more An AI gold mine: What happened at Google Cloud Next ‘20 OnAirAn AI gold mine: What happened at Google Cloud Next ‘20 OnAirProduct Marketing Lead, Cloud AI

Amazon CloudFront announces support for TLSv1.3 for viewer connections

Better Performance TLSv1.3 provides better performance with a simpler handshake process that requires fewer roundtrips. TLSv1.3 requires one round-trip (1-RTT) compared to TLSv1.2 that requires two round trips (2-RTT) to negotiate a new secure connection which translates into real-world performance improvements with lower first byte latency. In our own internal tests in the US region … Read more Amazon CloudFront announces support for TLSv1.3 for viewer connections

What is API-first? 5 opportunities to create business valueWhat is API-first? 5 opportunities to create business valueDigital Engagement Lead, Google CloudDirector of Product Management, Google Cloud

Our recent CIO survey with Oxford Economics contained a few takeaways that stood out to me: most companies are using API-first strategies, and those most committed to this concept report faster innovation and greater value from business partnerships.  Even so, the survey indicates that a healthy minority of enterprises still think of APIs in integration-first … Read more What is API-first? 5 opportunities to create business valueWhat is API-first? 5 opportunities to create business valueDigital Engagement Lead, Google CloudDirector of Product Management, Google Cloud

Presto Federated Queries

Getting Started with Presto Federated Queries using Ahana’s PrestoDB Sandbox on AWS Audio introduction to the post According to The Presto Foundation, Presto (aka PrestoDB), not to be confused with PrestoSQL, is an open-source, distributed, ANSI SQL compliant query engine built for running interactive, ad-hoc analytic queries against data sources of all sizes ranging from … Read more Presto Federated Queries

AWS Cost & Usage Report now offers Monthly Granularity

We are excited to announce that management (payer) accounts can now set up AWS Cost & Usage reports at a monthly level. The AWS Cost & Usage Report contains the most comprehensive set of billing data available. In addition to the amount and corresponding cost of your AWS service usage it also includes metadata such … Read more AWS Cost & Usage Report now offers Monthly Granularity

Next OnAir: Business application platform sessions to accelerate digital transformationNext OnAir: Business application platform sessions to accelerate digital transformation

In 2020, digital transformation journeys have been forced to accelerate and what would normally have taken years is now happening in a matter of months. From the proliferation of telehealth services to the growing primacy of online retail to the rise of digital management, forecasting, and contingency-planning, enterprises face more pressure than ever to digitally … Read more Next OnAir: Business application platform sessions to accelerate digital transformationNext OnAir: Business application platform sessions to accelerate digital transformation