Before we get started – just a quick note: this will only work for as long as your hob haven’t been submitted to a cluster or as long as your jobs run locally.
This is basically just the right thing to do if you want to debug configuration parameters or other set up relevant processes. I used this to debug a CLI call to a scalding job, for instance.
1) The first thing you need to do is to add the remote debugging facility to hadoop:
export HADOOP_OPTS="$HADOOP_OPTS -Xdebug -Xrunjdwp:transport=dt_socket,server=y,address=8999"
This need to be added to your conf/hadoop-env.sh or exported to your env variables.
2) Now eclipse:
Choose Run -> DebugConfigurations -> Remote Java Application
and add port 8999 to your connection settings. That’s what it should look like: