In this repository I explained all installation steps of Hadoop Architecture in Windows.
-
Updated
May 6, 2024
In this repository I explained all installation steps of Hadoop Architecture in Windows.
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
I installed Hadoop on Virtual Machine and all Assignments are performed on Ubuntu OS. Refer to this repo for completion of the Hadoop Assignments. It is recommended that you have a stable internet connection while doing these things.
A basic introductory example of hadoops mapreduce libraries to load and analyse large datasets in this case a US patent dataset sourced from https://www.nber.org/research/data/us-patents
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
The repo contains the steps for setting up the single node cluster in Hadoop 3.2.1 in Ubuntu 20.04 LTS
An Ansible Role to Configure and setup Hive Data WareHouse on Client Node.
This repository contains a simple Hadoop-like (MapReduce) distributed computing platform implemented in Java. It is extended from a course project at UIUC awarded the best Java version implementation and it's open-sourced for reference.
The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.
EMR 5.25.0 cluster single node Hadoop docker image. With Amazon Linux, Hadoop 2.8.5 and Hive 2.3.5
WQD7008 Parallel and Distributed Computing Project
PageRank algorithm written in Java MapReduce framework
MapReduce in Cluster.
Titanic data analysis with Hadoop
Distributed Hadoop and Spark based framework for in-memory GIS queries
Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark
MapReduce Python Example
Twitter data analysis using hadoop (hdfs), flume, map-reduce and hive. Sentiment Analysis is also done using affin dictionary for tweets related to Indian election.
Add a description, image, and links to the hadoop-framework topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-framework topic, visit your repo's landing page and select "manage topics."