• Services

    Services

    Explore our range of services designed to drive innovation and transform your business with cutting-edge technology solutions tailored to your needs

    AI/ML

    Data Analytics

    Cybersecurity

    Sales & Commerce

    UI/UX

    Guidewire

    Duck Creek

    OTT

    Cloud

    Product Engineering

    FinTech

    Digital Marketing

    ADM

    “Partnering with tringapps transformed our digital strategy with cutting-edge solutions, enhancing performance, scalability, and security and boosting our efficiency by 50%. Their expertise in innovation and execution made a tangible impact on our success.”

  • Insights
  • About us
  • Careers
  • Contact us
logo
logoImage closeIcon
  • Services
    subMenuIndicator
  • Insights
  • About us
  • Careers
Share on LinkedIn Share on X (Twitter) Visit Instagram Share on Facebook
  • Services

Serverless data lake using AWS

  • 361 Views
  • 07 Oct 2022

The power of Data in determining operational agility and enterprise business value cannot be overlooked. Analytics performed over data sources acquired from click-streams, social media, internet-connected devices, and log files provide fast integration to improve time to insights, business growth, production boost, customer retention, and taking the right calls at the right time.

A serverless data lake is a popular system of storing and analyzing data in a single repository and features autonomous maintenance and architectural flexibility for diverse kinds of data. The purpose of the Data Lake is the democratization of access to Data across the organization.

Enterprises are now migrating to the public cloud for creating Data Lakes on platforms, particularly AWS. Some of the reasons include cost optimization, zero requirement of operational maintenance, large and cheap storage, faster time to market, competent serverless components on AWS, DR and BCP availability, faster scalability, better security, and much more.

An architecture for a cloud-native, serverless data lake using AWS native resources like S3, Athena, and Glue.

Serverless datalake architecture

Steps to create a quick data lake in AWS is as follows,

Create an S3 bucket to store the data,

S3 bucket

Let’s try to query a sample data set which is currently in CSV format,

dataset

Create a new folder called CSV and upload the CSV to that folder in the S3 bucket that was created earlier.

CSV File

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development.

We will use AWS Glue to crawl the data to form the schema.

First, let’s add a new database.

database
database

Create a new crawler and use it to crawl the data that is stored in S3.

crawler

Add the S3 bucket as a source,

data source

Create an IAM role to have permissions to crawl the S3 data,

IAM role

Select the database which was created earlier,

target  database

Click on Review and Create.

Now once the crawler is created, click on “Run” to run the crawler,

Run the crawlers

Once the crawler has run, it will let us know the number of tables created,

created tables using crawlers

Check the schema of the table that was created,

schema

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

We will use Amazon Athena to query the data in S3,

Inside , set the query location

query result location

Using Athena – we can query the data inside S3 using standard SQL like below,

Athena

 

Athena result

To adapt and succeed, a technologically advanced organization must take advantage of every opportunity available to it. Today, no organization can afford to ignore the massive amount of Data at its disposal. A data lake provides unparalleled flexibility for unlocking data’s analytics potential.

Social Share
Prev Post Sustainability through Cloud Computing:…
Next Post Creating a connector to…

Related Post

Cloud Practise
01 Apr 2023

Navigating the Cloud: A Comprehensive Guide to Cloud Security

In the digital age, businesses are increasingly turning to cloud technology for…

04 Jan 2023

Six Ways to Reduce Your Cloud Bill

The vitality of any innovation is to make lives easier. Cloud computing…

Written by

Prashanth Gnanadesikan

Recent Articles
  • Fortifying Your Cloud: The Power of Minimal Human Interaction in Data Security
  • Cloud Security Unveiled: Crafting an Effective Incident Management Plan for a Secure Cloud
  • Safeguarding Your Cloud: Elevating Privacy and Security in the Digital Age
  • Unleashing the Power of Traceability in Cloud Security: A Comprehensive Guide
  • Building a Secure Cloud: The Importance of Identity Management
Search
Categories
  • AEM(2)
  • AI/ML(6)
  • Blogs(23)
  • Case study(37)
  • Cloud(6)
  • Cloud Computing(6)
  • Cloud Solutions(8)
  • Cost Optimization(1)
  • Cybersecurity(2)
  • Data Analytics(6)
  • Databricks(1)
  • eCommerce(3)
  • Guidewire(1)
  • Infrastructure(1)
  • OTT/Media(3)
  • SAP(1)
  • Serverless Computing(2)
  • Services(18)
  • Snowflake(3)
  • Support(1)
  • Technology(7)
Search Objects
Categories
  • AEM
  • AI/ML
  • Blogs
  • Case study
  • Cloud
  • Cloud Computing
  • Cloud Solutions
  • Cost Optimization
  • Cybersecurity
  • Data Analytics
  • Databricks
  • eCommerce
  • Guidewire
  • Infrastructure
  • OTT/Media
  • SAP
  • Serverless Computing
  • Services
  • Snowflake
  • Support
  • Technology
Popular Tags
data management data storage data warehouse vs data lake data warehouse vs data lakehouse

TRUSTED PARTNERSHIPS

OUR VALUED CLIENT

ic_eonline
ic_food_network
ic_kimberly_clark
ic_nbc
ic_overstock
ic_people
ic_realsimple
ic_reuters
ic_barclays
ic_scholastic
ic_sports_illustrated
ic_bloomberg
ic_cnbc
ic_wolter_kluwer
ic_entertainment_weekly
ic_jpmorgan
ic_bank_of_america
ic_decision_next
ic_HBOGO
ic_tribune_media
ic_Disnep Movie
ic_ap
ic_cedars_sinai
ic_chubb
ic_cinemax
ic_cnbc
ic_fidelity
ic_grio
ic_musc
ic_sopheon
ic_tact
ic_time
ic_nbc_universal
ic_zerosum
ic_gsk
ic_handlr
ic_hunt_killer
ic_jdrf
ic_kaplan
ic_kohl_s
ic_mobitv

GREAT OPPORTUNITY STARTS WITH A CONVERSATION

Contact Us

Experience the power of our cutting-edge technology firsthand

© 2025 TRINGAPPS, INC. ALL RIGHTS RESERVED

Services

AI/ML

Data Analytics

Cybersecurity

Sales & Commerce

UI/UX

Guidewire

Duck Creek

Services

AI/ML

Data Analytics

Cybersecurity

Sales & Commerce

UI/UX

Guidewire

Duck Creek


Product

Cloud

Product Engineering

FinTech

Digital Marketing

ADM


OTT

Cloud

Product Engineering

FinTech

Digital Marketing

ADM

Legal & Support

Terms and conditions​

Contact us

Cookie policy

Privacy policy

FAQ

Disclaimer

Company

About us

Careers

Legal & Support

Terms and conditions​

Contact us

Cookie policy

Privacy policy

FAQ

Disclaimer

Company

About us

Careers

SuccessIcon

Thank you!

Your message has been sent,
Our team will get back to you shortly.

Close