Big Data Project 2022 - Yet Another Map Reduce.

Pre-requisites and Assumptions

python is being used to run the code, for running with python3 python needs to be replaced with python3 in the WorkerNode.py file.
Commands to create and delete intermeddiate folders and files are written with respect to Windows OS, for any other OS changes need to be made accordingly in all the files.
A file needs to be written before it can be read.
All input files are present in the same directory as the code.
Before running the code, the directory needs to be cleaned of all the files generated by the code.
The MasterNode needs to be restarted before each operation by closing the terminal and opening a new one and running the command again.
The number of nodes mentioned for reading and writing have to be equal.

NOTE- Client terminal need not be restarted

For Execution

1. Clone the repository

Write Operation

2. Open Terminal in that directory
3. Start the MasterNode with the command
    python MasterNode.py
4. Open another terminal in the same directory
4. Start the Client with the name of file to be written and the number of worker nodes 
    python Client.py filename no_of_worker
For example
    python Client.py test.txt 2
Where test.txt is the file to be written and 2 is the number of worker nodes
5. Choose 1 in the menu to write the file
6. Ensure that the terminal of MasterNode is closed before starting a new operation

Read Operation

7. Open Terminal in that directory
8. Start the MasterNode with the command
    python MasterNode.py
9. Open another terminal in the same directory
10. Start the Client with the name of file to be read and the number of worker nodes 
    python Client.py filename no_of_worker
For example
    python Client.py test.txt 2
Where test.txt is the file to be read and 2 is the number of worker nodes
11. Choose 2 in the menu to read the file
12. Output will be displayed on the terminal
12. Ensure that the terminal of MasterNode is closed before starting a new operation

Map Reduce Operation

13. Open Terminal in that directory
14. Start the MasterNode with the command
    python MasterNode.py
15. Open another terminal in the same directory
16. Start the Client with the name of input file, mapper, reducer and the number of worker nodes 
    python Client.py filename mapper.py reducer.py no_of_worker
For example
    python Client.py test.txt Mapper.py Reducer.py 6
Where, 
test.txt is the input file 
Mapper.py is the mapper 
Reducer.py is the reducer
6 is the number of worker nodes
17. Choose 3 in the menu to perform Map Reduce
18. Output will be displayed on the terminal
19. Ensure that the terminal of MasterNode is closed before starting a new operation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big Data Project 2022 - Yet Another Map Reduce.

Pre-requisites and Assumptions

For Execution

Write Operation

Read Operation

Map Reduce Operation

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Client.py		Client.py
Mapper.py		Mapper.py
MasterNode.py		MasterNode.py
README.md		README.md
Reducer.py		Reducer.py
WorkerNode.py		WorkerNode.py
helloji.txt		helloji.txt
test.txt		test.txt

UtkarshBagaria/Yet-Another-Map-Reduce

Folders and files

Latest commit

History

Repository files navigation

Big Data Project 2022 - Yet Another Map Reduce.

Pre-requisites and Assumptions

For Execution

Write Operation

Read Operation

Map Reduce Operation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages