Open in app

Sign in

Medium Logo
Write

Sign in

Vinoth Chandar
Vinoth Chandar

105 followers

Home

About

bytearray

Published in

bytearray

Doing range gets on cloud storage for fun and profit

The thing that stands between good and great cloud read performance

Nov 28, 2023
1
Doing range gets on cloud storage for fun and profit
Doing range gets on cloud storage for fun and profit
Nov 28, 2023
1
bytearray

Published in

bytearray

Corrections in data lakehouse table format comparisons

A live document to serve as a point of reference for corrections for inaccuracies for different comparative studies of Hudi, Delta Lake, or…

Apr 20, 2022
2
Corrections in data lakehouse table format comparisons
Corrections in data lakehouse table format comparisons
Apr 20, 2022
2
apache-hudi-blogs

Published in

apache-hudi-blogs

Reliable ingestion from AWS S3 using Hudi

In this post we will talk about a new deltastreamer source which reliably and efficiently processes new data files as they arrive in AWS S3

Sep 2, 2021
Reliable ingestion from AWS S3 using Hudi
Reliable ingestion from AWS S3 using Hudi
Sep 2, 2021
apache-hudi-blogs

Published in

apache-hudi-blogs

Apache Hudi — The Streaming Data Lake Platform

This blog is a repost of the original blog here

Jul 27, 2021
Apache Hudi — The Streaming Data Lake Platform
Apache Hudi — The Streaming Data Lake Platform
Jul 27, 2021
apache-hudi-blogs

Published in

apache-hudi-blogs

Streaming Responsibly Into the Data Lake

How Apache Hudi maintains optimum sized files

Mar 15, 2021
Streaming Responsibly Into the Data Lake
Streaming Responsibly Into the Data Lake
Mar 15, 2021
apache-hudi-blogs

Published in

apache-hudi-blogs

Optimize Data Lake layout using Clustering in Apache Hudi

This blog is a repost of this Hudi blog on medium.

Jan 28, 2021
Optimize Data Lake layout using Clustering in Apache Hudi
Optimize Data Lake layout using Clustering in Apache Hudi
Jan 28, 2021
apache-hudi-blogs

Published in

apache-hudi-blogs

Employing the right indexes for fast updates, deletes in Apache Hudi

This blog is a repost of this Hudi blog on medium.

Dec 19, 2020
Employing the right indexes for fast updates, deletes in Apache Hudi
Employing the right indexes for fast updates, deletes in Apache Hudi
Dec 19, 2020
bytearray

Published in

bytearray

Apache Hudi (Incubating) Support on Apache Zeppelin

Reposted translation of the original article : https://mp.weixin.qq.com/s/_mNwL5uXSDYyqtLDPx0iDA

Apr 27, 2020
Apache Hudi (Incubating) Support on Apache Zeppelin
Apache Hudi (Incubating) Support on Apache Zeppelin
Apr 27, 2020

Embrace the Data Lake Architecture

Often times, data engineers build data pipelines to extract data from external sources, transform them and enable other parts of the…

Jun 8, 2019
Jun 8, 2019
bytearray

Published in

bytearray

Setting up Hadoop/YARN/Spark/Hive on Mac OSX

[Reposted from my blogger]

Sep 16, 2018
Sep 16, 2018
Vinoth Chandar

Vinoth Chandar

105 followers
Following
  • apache-hudi-blogs

    apache-hudi-blogs

  • onehouse-blogs

    onehouse-blogs

  • James Le

    James Le

  • bytearray

    bytearray

  • Medium Staff

    Medium Staff

See all (5)

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech