module Mapred_job_config:Extract job configuration, and marshallingsig..end
type m_job_config
val extract_job_config : Netplex_types.config_file ->
(string * string) list ->
string list -> Mapred_def.mapred_job_config * m_job_configlet (jc, mjc) = extract_job_config cf args custom_params:
Extracts the job configuration from cf. The association list
args may contain overrides (leftmost value is taken).
Returns the configuration as object jc, and in a marshallable
representation mjc.
val mapred_job_config : m_job_config -> Mapred_def.mapred_job_configval marshal : m_job_config -> stringval unmarshal : string -> m_job_config
netplex {
...
mapredjob {
<name> = <value>;
...
}
}
The possible names are the method names of Mapred_def.mapred_job_config.
The values should have the right type.
Example:
netplex {
mapredjob {
name = "my_job";
input_dir = "/input";
output_dir = "/output";
work_dir = "/work";
log_dir = "/log";
bigblock_size = 65536;
map_tasks = 100;
merge_limit = 4;
split_limit = 4;
partitions = 20;
}
}
Some settings have default values:
name is set to an automatically generated namebigblock_size is 16Mmap_tasks is 0 (meaning a good value is computed at runtime)merge_limit and split_limit are 4