clBuildProgram
Builds (compiles and links) a program executable from the program source or binary.
cl_int clBuildProgram ( | cl_program program, |
cl_uint num_devices, | |
const cl_device_id *device_list, | |
const char *options, | |
void (*pfn_notify)(cl_program, void *user_data), | |
void *user_data) |
Parameters
- program
The program object
- device_list
A pointer to a list of devices that are in
program
. Ifdevice_list
is NULL value, the program executable is built for all devices associated withprogram
for which a source or binary has been loaded. Ifdevice_list
is a non-NULL value, the program executable is built for devices specified in this list for which a source or binary has been loaded.- num_devices
The number of devices listed in
device_list
.- options
A pointer to a string that describes the build options to be used for building the program executable. The list of supported options is described in "Build Options" below.
- pfn_notify
A function pointer to a notification routine. The notification routine is a callback function that an application can register and which will be called when the program executable has been built (successfully or unsuccessfully). If
pfn_notify
is not NULL,clBuildProgram
does not need to wait for the build to complete and can return immediately. Ifpfn_notify
is NULL,clBuildProgram
does not return until the build has completed. This callback function may be called asynchronously by the OpenCL implementation. It is the application's responsibility to ensure that the callback function is thread-safe.- user_data
Passed as an argument when
pfn_notify
is called.user_data
can be NULL.
Notes
OpenCL allows program executables to be built using the source or the binary.
The build options are categorized as pre-processor options, options for math intrinsics, options that control optimization and miscellaneous options. This specification defines a standard set of options that must be supported by an OpenCL compiler when building program executables online or offline. These may be extended by a set of vendor- or platform-specific options.
Preprocessor Options
These options control the OpenCL preprocessor which is run on each program source before actual compilation. -D options are processed in the order they are given in the options argument toclBuildProgram
.- -D name
Predefine
name
as a macro, with definition 1.- -D name=definition
The contents of
definition
are tokenized and processed as if they appeared during translation phase three in a `#define' directive. In particular, the definition will be truncated by embedded newline characters.- -I dir
Add the directory
dir
to the list of directories to be searched for header files.
Math Intrinsics Options
These options control compiler behavior regarding floating-point arithmetic. These options trade off between speed and correctness.- -cl-single-precision-constant
Treat double precision floating-point constant as single precision constant.
- -cl-denorms-are-zero
This option controls how single precision and double precision denormalized numbers are handled. If specified as a build option, the single precision denormalized numbers may be flushed to zero and if the optional extension for double precision is supported, double precision denormalized numbers may also be flushed to zero. This is intended to be a performance hint and the OpenCL compiler can choose not to flush denorms to zero if the device supports single precision (or double precision) denormalized numbers.
This option is ignored for single precision numbers if the device does not support single precision denormalized numbers i.e. CL_FP_DENORM bit is not set in CL_DEVICE_SINGLE_FP_CONFIG.
This option is ignored for double precision numbers if the device does not support double precision or if it does support double precison but CL_FP_DENORM bit is not set in CL_DEVICE_DOUBLE_FP_CONFIG.
This flag only applies for scalar and vector single precision floating-point variables and computations on these floating-point variables inside a program. It does not apply to reading from or writing to image objects.
Optimization Options
These options control various sorts of optimizations. Turning on optimization flags makes the compiler attempt to improve the performance and/or code size at the expense of compilation time and possibly the ability to debug the program.- -cl-opt-disable
This option disables all optimizations. The default is optimizations are enabled.
- -cl-strict-aliasing
This option allows the compiler to assume the strictest aliasing rules.
The following options control compiler behavior regarding floating-point arithmetic. These options trade off between performance and correctness and must be specifically enabled. These options are not turned on by default since it can result in incorrect output for programs which depend on an exact implementation of IEEE 754 rules/specifications for math functions.
- -cl-mad-enable
Allow
a * b + c
to be replaced by amad
. Themad
computesa * b + c
with reduced accuracy. For example, some OpenCL devices implementmad
as truncate the result ofa * b
before adding it toc
.- -cl-no-signed-zeros
Allow optimizations for floating-point arithmetic that ignore the signedness of zero. IEEE 754 arithmetic specifies the behavior of distinct
+0.0
and-0.0
values, which then prohibits simplification of expressions such asx+0.0
or0.0*x
(even with -clfinite-math only). This option implies that the sign of a zero result isn't significant.- -cl-unsafe-math-optimizations
Allow optimizations for floating-point arithmetic that (a) assume that arguments and results are valid, (b) may violate IEEE 754 standard and (c) may violate the OpenCL numerical compliance requirements as defined in section 7.4 for single-precision floating-point, section 9.3.9 for double-precision floating-point, and edge case behavior in section 7.5. This option includes the -cl-no-signed-zeros and -cl-mad-enable options.
- -cl-finite-math-only
Allow optimizations for floating-point arithmetic that assume that arguments and results are not NaNs or ±∞. This option may violate the OpenCL numerical compliance requirements defined in in section 7.4 for single-precision floating-point, section 9.3.9 for double-precision floating-point, and edge case behavior in section 7.5.
- -cl-fast-relaxed-math
Sets the optimization options -cl-finite-math-only and -cl-unsafe-math-optimizations. This allows optimizations for floating-point arithmetic that may violate the IEEE 754 standard and the OpenCL numerical compliance requirements defined in the specification in section 7.4 for single-precision floating-point, section 9.3.9 for double-precision floating-point, and edge case behavior in section 7.5. This option causes the preprocessor macro
__FAST_RELAXED_MATH__
to be defined in the OpenCL program.
Options to Request or Suppress Warnings
Warnings are diagnostic messages that report constructions which are not inherently erroneous but which are risky or suggest there may have been an error. The following languageindependent options do not enable specific warnings but control the kinds of diagnostics produced by the OpenCL compiler.- -w
Inhibit all warning messages.
- -Werror
Make all warnings into errors.
Errors
clBuildProgram
returns CL_SUCCESS if the function is executed successfully. Otherwise, it returns one of the following errors:
- CL_INVALID_PROGRAM if
program
is not a valid program object. - CL_INVALID_VALUE if
device_list
is NULL andnum_devices
is greater than zero, or ifdevice_list
is not NULL andnum_devices
is zero. - CL_INVALID_VALUE if
pfn_notify
is NULL butuser_data
is not NULL. - CL_INVALID_DEVICE if OpenCL devices listed in
device_list
are not in the list of devices associated withprogram
. - CL_INVALID_BINARY if
program
is created with clCreateWithProgramWithBinary and devices listed indevice_list
do not have a valid program binary loaded. - CL_INVALID_BUILD_OPTIONS if the build options specified by
options
are invalid. - CL_INVALID_OPERATION if the build of a program executable for any of the devices listed in
device_list
by a previous call to clBuildProgram forprogram
has not completed. - CL_COMPILER_NOT_AVAILABLE if
program
is created with clCreateProgramWithSource and a compiler is not available i.e.CL_DEVICE_COMPILER_AVAILABLE specified in the table of OpenCL Device Queries for clGetDeviceInfo is set to CL_FALSE. - CL_BUILD_PROGRAM_FAILURE if there is a failure to build the program executable. This error will be returned if
clBuildProgram
does not return until the build has completed. - CL_INVALID_OPERATION if there are kernel objects attached to
program
. - CL_OUT_OF_HOST_MEMORY if there is a failure to allocate resources required by the OpenCL implementation on the host.
'Program Language > OpenCL' 카테고리의 다른 글
amd opencl __constant (0) | 2012.06.20 |
---|---|
standard functions in OpenCL (0) | 2012.06.04 |
OpenCL Address Spaces (0) | 2012.06.02 |
clFinish, clFlush, block (0) | 2012.05.26 |
clEnqueueCopyBufferRect vs clEnqueueWriteBufferRect (0) | 2012.05.22 |
댓글