In this blog, I’ll show you how to incorporate several AWS services into a simple, serverless workflow. The idea is to create an SQS queue, then using an HTTP API to trigger a Lambda function that sends messages to the queue. What is serverless computing? Serverless computing is a means of offering on-demand backend services. Users may use…
Author: Jaya Keerthi Nannapaneni
Using python SDK to create S3 bucket, upload and download files from S3 bucket
This blog post will focus on creating the S3 bucket, uploading a file and downloading a file from the S3 bucket. AWS SDK for Python (Boto3) makes it easy to integrate your Python application to create, configure, and manage AWS services, such as Amazon Elastic Compute Cloud (Amazon EC2) and…
Azure Databricks
Overview: Azure Databricks is a simple, quick, and collaborative Apache Spark-based analytics platform. It boosts the innovation by bringing together data science, data engineering and business. Azure Databricks is a cloud-optimized version of Apache Spark that is one of the most powerful analytics platforms on the Azure Cloud. The topics…
Azure Databricks pricing
Back to Home Pay as you go: The cost of Azure Databricks is determined by the number of virtual machines managed in clusters and the number of Databricks Units specified. A Databricks Unit (DBU) is a processing facility unit that is invoiced on a per-second basis. DBU consumption is determined by…
Azure Databricks architecture and its components
Back to Home Architecture Dataflow: Azure Databricks ingests raw streaming data from Azure Event Hubs. Data Factory loads raw batch data into Data Lake Storage. For data storage Data Lake Storage houses stores data of all types, such as structured, unstructured, and semi-structured. It also stores batch and streaming data.…
Reasons why we use Databricks today
Back to home Databricks is an industry-leading, cloud-based data engineering tool used for processing and transforming massive quantities of data and exploring the data through machine learning models. Recently added to Azure, it’s the latest big data tool for the Microsoft cloud. Available to all organizations, it allows them to…
How to create a Databricks Workspace and cluster
Back to home To get one example of provisioning Instance on Azure Databricks, let’s start with this 8 minute video tutorial: Create your workspace and cluster: 1. Log in to the Azure portal Note: If you don’t have the Microsoft Azure account, create a new account using this blog: prepare…
Medallion architecture with Databricks
Back to the home What is a medallion architecture? A medallion architecture is a data design pattern used to logically organize data in a lakehouse, with the goal of incrementally and progressively improving the structure and quality of data as it flows through each layer of the architecture (from Bronze ⇒ Silver ⇒…
Features of Azure Databricks
Back to home We have seen what Azure Databricks is and the reasons why it is the best analytics tool. Now, let us move further with a few more details about the analytics tool. Here are some of the rich features of Azure Databricks, Optimized Apache Spark environment : It…