A Blog about Software Development

A Blog about Software Development

How to compile Hadoop from source

PublishedFebruary 18, 2016

•1 min read•View as Markdown

Hands-on technology leader with 10+ years building scalable, mission-critical systems at Goldman Sachs, Brevan Howard and fast-growing fintechs. Expert in cloud-native architectures, distributed data pipelines and high-throughput systems; experienced in migrating legacy platforms and designing AI-enabled services. Proven track record delivering reliable platforms that process millions of transactions daily.

Hadoop is a great (and complex) software framework with a lot of dependencies and configurations you need to make. One way to get up to speed with it is to download ready-made release binaries and just install them (You can find related links at http://hadoop.apache.org/releases.html).

But if you want to have the latest source code (and probably work on it), you will need to check them out from Apache Hadoop's git repository. Before you can run Hadoop this way, you will need to set-up your system. This setup will include:
1. Installing required OS packages (autoconf, cmake, libtool, …)
2. Installing JDK and Maven
3. Installing protocol buffers with the correct version
4. Installing Hadoop maven plugins
5. Compiling and installing Hadoop
6. Setting up password-less SSH and some environment variables

I have prepared a 'Vagrantfile' with instructions to provision the vagrant machine to do above steps. All required steps are explained in README file of the repository: https://github.com/mm-binary/hadoop-src-getting-started

<p>
</p>

Comments

Join the discussion

No comments yet. Be the first to comment.

More from this blog

Stop Treating AI Like a Coworker

It’s an Exoskeleton, and That Changes Everything Most people are thinking about AI the wrong way. They imagine it as a coworker:Something you assign tasks to… wait for… and hope it delivers. Sounds re

Mar 17, 20264 min read

My AI Adoption Journey: From Skeptic to Daily Power User

I didn’t wake up one day and decide, “AI will change everything.”My journey into AI was slower, messier, and honestly… a bit reluctant at first. If you’re a builder, engineer, or curious technologist,

Mar 2, 20265 min read

Meet NanoLang: The Tiny Programming Language Built for AI (and Curious Devs)

Imagine a programming language designed not just for humans but also for AI.Not adapted for AI. Not retrofitted. Built from scratch so AI can read and write it easily. That’s exactly what NanoLang is. It’s a tiny, experimental language created by vet...

Feb 16, 20265 min read

Supercharged PostgreSQL Tips: Less Boring, More Powerful

Most PostgreSQL optimization guides feel like laundry lists of settings and indexes. But real performance gains often come from clever ideas, not just the usual tricks. Let’s unpack a few of those ideas in ways that actually make sense for you. These...

Feb 10, 20265 min read

Pop-Ups Are Back, Baby, And Browsers Don't Care

Remember when we defeated pop-up ads? Yeah, they're back. And this time, nobody's fighting them. The Good Old Days (Of Terrible Ads) Back around 2000, the internet was a warzone. Visit any website and BAM!!! random windows would explode onto your scr...

Jan 26, 20262 min read

A

A Blog about Software Development

96 posts