Build stage dependency for a job
Build stage dependency for a job
ID of the job
Root stage of the job
Close log file, and clean the stage relationship in stageIDToJobID
Close log file, and clean the stage relationship in stageIDToJobID
ID of the job
Create a folder for log files, the folder's name is the creation time of jobLogger
Create a folder for log files, the folder's name is the creation time of jobLogger
Create a log file for one job
Create a log file for one job
ID of the job
Get RDD's name
Get RDD's name
Input RDD
String of RDD's name
Generate indents and convert to String
Generate indents and convert to String
Number of indents
string of indents
Write info into log file
Write info into log file
ID of the job
Info to be recorded
Controls whether to record time stamp before the info, default is true
The base directory for the log files.
When job ends, recording job completion status and close log file
When job ends, recording job completion status and close log file
Job end event
When job starts, record job property and stage graph
When job starts, record job property and stage graph
Job start event
When stage is completed, record stage completion status
When stage is completed, record stage completion status
Stage completed event
When stage is submitted, record stage submit info
When stage is submitted, record stage submit info
Stage submitted event
When task ends, record task completion status and metrics
When task ends, record task completion status and metrics
Task end event
Called when a task begins remotely fetching its result (will not be called for tasks that do not need to fetch the result remotely).
Called when a task begins remotely fetching its result (will not be called for tasks that do not need to fetch the result remotely).
Called when a task starts
Called when a task starts
Record job properties into job log file
Record job properties into job log file
ID of the job
Properties of the job
Record RDD dependency graph in a stage
Record RDD dependency graph in a stage
Job ID of the stage
Root RDD of the stage
Indent number before info
Record stage dependency and RDD dependency for a stage
Record stage dependency and RDD dependency for a stage
Job ID of the stage
Record stage dependency graph of a job
Record stage dependency graph of a job
Job ID of the stage
Root stage of the job
Indent number before info, default is 0
Record task metrics into job log files, including execution info and shuffle metrics
Record task metrics into job log files, including execution info and shuffle metrics
Stage ID of the task
Status info of the task
Task description info
Task running metrics
Write info into log file
Write info into log file
ID of the stage
Info to be recorded
Controls whether to record time stamp before the info, default is true
A logger class to record runtime information for jobs in Spark. This class outputs one log file for each Spark job, containing RDD graph, tasks start/stop, shuffle information. JobLogger is a subclass of SparkListener, use addSparkListener to add JobLogger to a SparkContext after the SparkContext is created. Note that each JobLogger only works for one SparkContext