Vinoth Chandar – Medium

Vinoth Chandar

Published in
bytearray

Doing range gets on cloud storage for fun and profit

The thing that stands between good and great cloud read performance

Nov 28, 2023

Doing range gets on cloud storage for fun and profit

Nov 28, 2023

Published in
bytearray

Corrections in data lakehouse table format comparisons

A live document to serve as a point of reference for corrections for inaccuracies for different comparative studies of Hudi, Delta Lake, or…

Apr 20, 2022

Corrections in data lakehouse table format comparisons

Apr 20, 2022

Published in
apache-hudi-blogs

Reliable ingestion from AWS S3 using Hudi

In this post we will talk about a new deltastreamer source which reliably and efficiently processes new data files as they arrive in AWS S3

Sep 2, 2021

Reliable ingestion from AWS S3 using Hudi

Sep 2, 2021

Published in
apache-hudi-blogs

Apache Hudi — The Streaming Data Lake Platform

This blog is a repost of the original blog here

Jul 27, 2021

Apache Hudi — The Streaming Data Lake Platform

Jul 27, 2021

Published in
apache-hudi-blogs

Streaming Responsibly Into the Data Lake

How Apache Hudi maintains optimum sized files

Mar 15, 2021

Streaming Responsibly Into the Data Lake

Mar 15, 2021

Published in
apache-hudi-blogs

Optimize Data Lake layout using Clustering in Apache Hudi

This blog is a repost of this Hudi blog on medium.

Jan 28, 2021

Optimize Data Lake layout using Clustering in Apache Hudi

Jan 28, 2021

Published in
apache-hudi-blogs

Employing the right indexes for fast updates, deletes in Apache Hudi

This blog is a repost of this Hudi blog on medium.

Dec 19, 2020

Employing the right indexes for fast updates, deletes in Apache Hudi

Dec 19, 2020

Published in
bytearray

Apache Hudi (Incubating) Support on Apache Zeppelin

Reposted translation of the original article : https://mp.weixin.qq.com/s/_mNwL5uXSDYyqtLDPx0iDA

Apr 27, 2020

Apache Hudi (Incubating) Support on Apache Zeppelin

Apr 27, 2020

Embrace the Data Lake Architecture

Often times, data engineers build data pipelines to extract data from external sources, transform them and enable other parts of the…

Jun 8, 2019

Jun 8, 2019

Published in
bytearray

Setting up Hadoop/YARN/Spark/Hive on Mac OSX

[Reposted from my blogger]

Sep 16, 2018

Sep 16, 2018

Vinoth Chandar

Vinoth Chandar

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech