1USING THE IJG JPEG LIBRARY 2 3This file was part of the Independent JPEG Group's software: 4Copyright (C) 1994-2011, Thomas G. Lane, Guido Vollbeding. 5libjpeg-turbo Modifications: 6Copyright (C) 2010, 2014-2016, D. R. Commander. 7Copyright (C) 2015, Google, Inc. 8For conditions of distribution and use, see the accompanying README file. 9 10 11This file describes how to use the IJG JPEG library within an application 12program. Read it if you want to write a program that uses the library. 13 14The file example.c provides heavily commented skeleton code for calling the 15JPEG library. Also see jpeglib.h (the include file to be used by application 16programs) for full details about data structures and function parameter lists. 17The library source code, of course, is the ultimate reference. 18 19Note that there have been *major* changes from the application interface 20presented by IJG version 4 and earlier versions. The old design had several 21inherent limitations, and it had accumulated a lot of cruft as we added 22features while trying to minimize application-interface changes. We have 23sacrificed backward compatibility in the version 5 rewrite, but we think the 24improvements justify this. 25 26 27TABLE OF CONTENTS 28----------------- 29 30Overview: 31 Functions provided by the library 32 Outline of typical usage 33Basic library usage: 34 Data formats 35 Compression details 36 Decompression details 37 Mechanics of usage: include files, linking, etc 38Advanced features: 39 Compression parameter selection 40 Decompression parameter selection 41 Special color spaces 42 Error handling 43 Compressed data handling (source and destination managers) 44 I/O suspension 45 Progressive JPEG support 46 Buffered-image mode 47 Abbreviated datastreams and multiple images 48 Special markers 49 Raw (downsampled) image data 50 Really raw data: DCT coefficients 51 Progress monitoring 52 Memory management 53 Memory usage 54 Library compile-time options 55 Portability considerations 56 57You should read at least the overview and basic usage sections before trying 58to program with the library. The sections on advanced features can be read 59if and when you need them. 60 61 62OVERVIEW 63======== 64 65Functions provided by the library 66--------------------------------- 67 68The IJG JPEG library provides C code to read and write JPEG-compressed image 69files. The surrounding application program receives or supplies image data a 70scanline at a time, using a straightforward uncompressed image format. All 71details of color conversion and other preprocessing/postprocessing can be 72handled by the library. 73 74The library includes a substantial amount of code that is not covered by the 75JPEG standard but is necessary for typical applications of JPEG. These 76functions preprocess the image before JPEG compression or postprocess it after 77decompression. They include colorspace conversion, downsampling/upsampling, 78and color quantization. The application indirectly selects use of this code 79by specifying the format in which it wishes to supply or receive image data. 80For example, if colormapped output is requested, then the decompression 81library automatically invokes color quantization. 82 83A wide range of quality vs. speed tradeoffs are possible in JPEG processing, 84and even more so in decompression postprocessing. The decompression library 85provides multiple implementations that cover most of the useful tradeoffs, 86ranging from very-high-quality down to fast-preview operation. On the 87compression side we have generally not provided low-quality choices, since 88compression is normally less time-critical. It should be understood that the 89low-quality modes may not meet the JPEG standard's accuracy requirements; 90nonetheless, they are useful for viewers. 91 92A word about functions *not* provided by the library. We handle a subset of 93the ISO JPEG standard; most baseline, extended-sequential, and progressive 94JPEG processes are supported. (Our subset includes all features now in common 95use.) Unsupported ISO options include: 96 * Hierarchical storage 97 * Lossless JPEG 98 * DNL marker 99 * Nonintegral subsampling ratios 100We support both 8- and 12-bit data precision, but this is a compile-time 101choice rather than a run-time choice; hence it is difficult to use both 102precisions in a single application. 103 104By itself, the library handles only interchange JPEG datastreams --- in 105particular the widely used JFIF file format. The library can be used by 106surrounding code to process interchange or abbreviated JPEG datastreams that 107are embedded in more complex file formats. (For example, this library is 108used by the free LIBTIFF library to support JPEG compression in TIFF.) 109 110 111Outline of typical usage 112------------------------ 113 114The rough outline of a JPEG compression operation is: 115 116 Allocate and initialize a JPEG compression object 117 Specify the destination for the compressed data (eg, a file) 118 Set parameters for compression, including image size & colorspace 119 jpeg_start_compress(...); 120 while (scan lines remain to be written) 121 jpeg_write_scanlines(...); 122 jpeg_finish_compress(...); 123 Release the JPEG compression object 124 125A JPEG compression object holds parameters and working state for the JPEG 126library. We make creation/destruction of the object separate from starting 127or finishing compression of an image; the same object can be re-used for a 128series of image compression operations. This makes it easy to re-use the 129same parameter settings for a sequence of images. Re-use of a JPEG object 130also has important implications for processing abbreviated JPEG datastreams, 131as discussed later. 132 133The image data to be compressed is supplied to jpeg_write_scanlines() from 134in-memory buffers. If the application is doing file-to-file compression, 135reading image data from the source file is the application's responsibility. 136The library emits compressed data by calling a "data destination manager", 137which typically will write the data into a file; but the application can 138provide its own destination manager to do something else. 139 140Similarly, the rough outline of a JPEG decompression operation is: 141 142 Allocate and initialize a JPEG decompression object 143 Specify the source of the compressed data (eg, a file) 144 Call jpeg_read_header() to obtain image info 145 Set parameters for decompression 146 jpeg_start_decompress(...); 147 while (scan lines remain to be read) 148 jpeg_read_scanlines(...); 149 jpeg_finish_decompress(...); 150 Release the JPEG decompression object 151 152This is comparable to the compression outline except that reading the 153datastream header is a separate step. This is helpful because information 154about the image's size, colorspace, etc is available when the application 155selects decompression parameters. For example, the application can choose an 156output scaling ratio that will fit the image into the available screen size. 157 158The decompression library obtains compressed data by calling a data source 159manager, which typically will read the data from a file; but other behaviors 160can be obtained with a custom source manager. Decompressed data is delivered 161into in-memory buffers passed to jpeg_read_scanlines(). 162 163It is possible to abort an incomplete compression or decompression operation 164by calling jpeg_abort(); or, if you do not need to retain the JPEG object, 165simply release it by calling jpeg_destroy(). 166 167JPEG compression and decompression objects are two separate struct types. 168However, they share some common fields, and certain routines such as 169jpeg_destroy() can work on either type of object. 170 171The JPEG library has no static variables: all state is in the compression 172or decompression object. Therefore it is possible to process multiple 173compression and decompression operations concurrently, using multiple JPEG 174objects. 175 176Both compression and decompression can be done in an incremental memory-to- 177memory fashion, if suitable source/destination managers are used. See the 178section on "I/O suspension" for more details. 179 180 181BASIC LIBRARY USAGE 182=================== 183 184Data formats 185------------ 186 187Before diving into procedural details, it is helpful to understand the 188image data format that the JPEG library expects or returns. 189 190The standard input image format is a rectangular array of pixels, with each 191pixel having the same number of "component" or "sample" values (color 192channels). You must specify how many components there are and the colorspace 193interpretation of the components. Most applications will use RGB data 194(three components per pixel) or grayscale data (one component per pixel). 195PLEASE NOTE THAT RGB DATA IS THREE SAMPLES PER PIXEL, GRAYSCALE ONLY ONE. 196A remarkable number of people manage to miss this, only to find that their 197programs don't work with grayscale JPEG files. 198 199There is no provision for colormapped input. JPEG files are always full-color 200or full grayscale (or sometimes another colorspace such as CMYK). You can 201feed in a colormapped image by expanding it to full-color format. However 202JPEG often doesn't work very well with source data that has been colormapped, 203because of dithering noise. This is discussed in more detail in the JPEG FAQ 204and the other references mentioned in the README file. 205 206Pixels are stored by scanlines, with each scanline running from left to 207right. The component values for each pixel are adjacent in the row; for 208example, R,G,B,R,G,B,R,G,B,... for 24-bit RGB color. Each scanline is an 209array of data type JSAMPLE --- which is typically "unsigned char", unless 210you've changed jmorecfg.h. (You can also change the RGB pixel layout, say 211to B,G,R order, by modifying jmorecfg.h. But see the restrictions listed in 212that file before doing so.) 213 214A 2-D array of pixels is formed by making a list of pointers to the starts of 215scanlines; so the scanlines need not be physically adjacent in memory. Even 216if you process just one scanline at a time, you must make a one-element 217pointer array to conform to this structure. Pointers to JSAMPLE rows are of 218type JSAMPROW, and the pointer to the pointer array is of type JSAMPARRAY. 219 220The library accepts or supplies one or more complete scanlines per call. 221It is not possible to process part of a row at a time. Scanlines are always 222processed top-to-bottom. You can process an entire image in one call if you 223have it all in memory, but usually it's simplest to process one scanline at 224a time. 225 226For best results, source data values should have the precision specified by 227BITS_IN_JSAMPLE (normally 8 bits). For instance, if you choose to compress 228data that's only 6 bits/channel, you should left-justify each value in a 229byte before passing it to the compressor. If you need to compress data 230that has more than 8 bits/channel, compile with BITS_IN_JSAMPLE = 12. 231(See "Library compile-time options", later.) 232 233 234The data format returned by the decompressor is the same in all details, 235except that colormapped output is supported. (Again, a JPEG file is never 236colormapped. But you can ask the decompressor to perform on-the-fly color 237quantization to deliver colormapped output.) If you request colormapped 238output then the returned data array contains a single JSAMPLE per pixel; 239its value is an index into a color map. The color map is represented as 240a 2-D JSAMPARRAY in which each row holds the values of one color component, 241that is, colormap[i][j] is the value of the i'th color component for pixel 242value (map index) j. Note that since the colormap indexes are stored in 243JSAMPLEs, the maximum number of colors is limited by the size of JSAMPLE 244(ie, at most 256 colors for an 8-bit JPEG library). 245 246 247Compression details 248------------------- 249 250Here we revisit the JPEG compression outline given in the overview. 251 2521. Allocate and initialize a JPEG compression object. 253 254A JPEG compression object is a "struct jpeg_compress_struct". (It also has 255a bunch of subsidiary structures which are allocated via malloc(), but the 256application doesn't control those directly.) This struct can be just a local 257variable in the calling routine, if a single routine is going to execute the 258whole JPEG compression sequence. Otherwise it can be static or allocated 259from malloc(). 260 261You will also need a structure representing a JPEG error handler. The part 262of this that the library cares about is a "struct jpeg_error_mgr". If you 263are providing your own error handler, you'll typically want to embed the 264jpeg_error_mgr struct in a larger structure; this is discussed later under 265"Error handling". For now we'll assume you are just using the default error 266handler. The default error handler will print JPEG error/warning messages 267on stderr, and it will call exit() if a fatal error occurs. 268 269You must initialize the error handler structure, store a pointer to it into 270the JPEG object's "err" field, and then call jpeg_create_compress() to 271initialize the rest of the JPEG object. 272 273Typical code for this step, if you are using the default error handler, is 274 275 struct jpeg_compress_struct cinfo; 276 struct jpeg_error_mgr jerr; 277 ... 278 cinfo.err = jpeg_std_error(&jerr); 279 jpeg_create_compress(&cinfo); 280 281jpeg_create_compress allocates a small amount of memory, so it could fail 282if you are out of memory. In that case it will exit via the error handler; 283that's why the error handler must be initialized first. 284 285 2862. Specify the destination for the compressed data (eg, a file). 287 288As previously mentioned, the JPEG library delivers compressed data to a 289"data destination" module. The library includes one data destination 290module which knows how to write to a stdio stream. You can use your own 291destination module if you want to do something else, as discussed later. 292 293If you use the standard destination module, you must open the target stdio 294stream beforehand. Typical code for this step looks like: 295 296 FILE * outfile; 297 ... 298 if ((outfile = fopen(filename, "wb")) == NULL) { 299 fprintf(stderr, "can't open %s\n", filename); 300 exit(1); 301 } 302 jpeg_stdio_dest(&cinfo, outfile); 303 304where the last line invokes the standard destination module. 305 306WARNING: it is critical that the binary compressed data be delivered to the 307output file unchanged. On non-Unix systems the stdio library may perform 308newline translation or otherwise corrupt binary data. To suppress this 309behavior, you may need to use a "b" option to fopen (as shown above), or use 310setmode() or another routine to put the stdio stream in binary mode. See 311cjpeg.c and djpeg.c for code that has been found to work on many systems. 312 313You can select the data destination after setting other parameters (step 3), 314if that's more convenient. You may not change the destination between 315calling jpeg_start_compress() and jpeg_finish_compress(). 316 317 3183. Set parameters for compression, including image size & colorspace. 319 320You must supply information about the source image by setting the following 321fields in the JPEG object (cinfo structure): 322 323 image_width Width of image, in pixels 324 image_height Height of image, in pixels 325 input_components Number of color channels (samples per pixel) 326 in_color_space Color space of source image 327 328The image dimensions are, hopefully, obvious. JPEG supports image dimensions 329of 1 to 64K pixels in either direction. The input color space is typically 330RGB or grayscale, and input_components is 3 or 1 accordingly. (See "Special 331color spaces", later, for more info.) The in_color_space field must be 332assigned one of the J_COLOR_SPACE enum constants, typically JCS_RGB or 333JCS_GRAYSCALE. 334 335JPEG has a large number of compression parameters that determine how the 336image is encoded. Most applications don't need or want to know about all 337these parameters. You can set all the parameters to reasonable defaults by 338calling jpeg_set_defaults(); then, if there are particular values you want 339to change, you can do so after that. The "Compression parameter selection" 340section tells about all the parameters. 341 342You must set in_color_space correctly before calling jpeg_set_defaults(), 343because the defaults depend on the source image colorspace. However the 344other three source image parameters need not be valid until you call 345jpeg_start_compress(). There's no harm in calling jpeg_set_defaults() more 346than once, if that happens to be convenient. 347 348Typical code for a 24-bit RGB source image is 349 350 cinfo.image_width = Width; /* image width and height, in pixels */ 351 cinfo.image_height = Height; 352 cinfo.input_components = 3; /* # of color components per pixel */ 353 cinfo.in_color_space = JCS_RGB; /* colorspace of input image */ 354 355 jpeg_set_defaults(&cinfo); 356 /* Make optional parameter settings here */ 357 358 3594. jpeg_start_compress(...); 360 361After you have established the data destination and set all the necessary 362source image info and other parameters, call jpeg_start_compress() to begin 363a compression cycle. This will initialize internal state, allocate working 364storage, and emit the first few bytes of the JPEG datastream header. 365 366Typical code: 367 368 jpeg_start_compress(&cinfo, TRUE); 369 370The "TRUE" parameter ensures that a complete JPEG interchange datastream 371will be written. This is appropriate in most cases. If you think you might 372want to use an abbreviated datastream, read the section on abbreviated 373datastreams, below. 374 375Once you have called jpeg_start_compress(), you may not alter any JPEG 376parameters or other fields of the JPEG object until you have completed 377the compression cycle. 378 379 3805. while (scan lines remain to be written) 381 jpeg_write_scanlines(...); 382 383Now write all the required image data by calling jpeg_write_scanlines() 384one or more times. You can pass one or more scanlines in each call, up 385to the total image height. In most applications it is convenient to pass 386just one or a few scanlines at a time. The expected format for the passed 387data is discussed under "Data formats", above. 388 389Image data should be written in top-to-bottom scanline order. The JPEG spec 390contains some weasel wording about how top and bottom are application-defined 391terms (a curious interpretation of the English language...) but if you want 392your files to be compatible with everyone else's, you WILL use top-to-bottom 393order. If the source data must be read in bottom-to-top order, you can use 394the JPEG library's virtual array mechanism to invert the data efficiently. 395Examples of this can be found in the sample application cjpeg. 396 397The library maintains a count of the number of scanlines written so far 398in the next_scanline field of the JPEG object. Usually you can just use 399this variable as the loop counter, so that the loop test looks like 400"while (cinfo.next_scanline < cinfo.image_height)". 401 402Code for this step depends heavily on the way that you store the source data. 403example.c shows the following code for the case of a full-size 2-D source 404array containing 3-byte RGB pixels: 405 406 JSAMPROW row_pointer[1]; /* pointer to a single row */ 407 int row_stride; /* physical row width in buffer */ 408 409 row_stride = image_width * 3; /* JSAMPLEs per row in image_buffer */ 410 411 while (cinfo.next_scanline < cinfo.image_height) { 412 row_pointer[0] = & image_buffer[cinfo.next_scanline * row_stride]; 413 jpeg_write_scanlines(&cinfo, row_pointer, 1); 414 } 415 416jpeg_write_scanlines() returns the number of scanlines actually written. 417This will normally be equal to the number passed in, so you can usually 418ignore the return value. It is different in just two cases: 419 * If you try to write more scanlines than the declared image height, 420 the additional scanlines are ignored. 421 * If you use a suspending data destination manager, output buffer overrun 422 will cause the compressor to return before accepting all the passed lines. 423 This feature is discussed under "I/O suspension", below. The normal 424 stdio destination manager will NOT cause this to happen. 425In any case, the return value is the same as the change in the value of 426next_scanline. 427 428 4296. jpeg_finish_compress(...); 430 431After all the image data has been written, call jpeg_finish_compress() to 432complete the compression cycle. This step is ESSENTIAL to ensure that the 433last bufferload of data is written to the data destination. 434jpeg_finish_compress() also releases working memory associated with the JPEG 435object. 436 437Typical code: 438 439 jpeg_finish_compress(&cinfo); 440 441If using the stdio destination manager, don't forget to close the output 442stdio stream (if necessary) afterwards. 443 444If you have requested a multi-pass operating mode, such as Huffman code 445optimization, jpeg_finish_compress() will perform the additional passes using 446data buffered by the first pass. In this case jpeg_finish_compress() may take 447quite a while to complete. With the default compression parameters, this will 448not happen. 449 450It is an error to call jpeg_finish_compress() before writing the necessary 451total number of scanlines. If you wish to abort compression, call 452jpeg_abort() as discussed below. 453 454After completing a compression cycle, you may dispose of the JPEG object 455as discussed next, or you may use it to compress another image. In that case 456return to step 2, 3, or 4 as appropriate. If you do not change the 457destination manager, the new datastream will be written to the same target. 458If you do not change any JPEG parameters, the new datastream will be written 459with the same parameters as before. Note that you can change the input image 460dimensions freely between cycles, but if you change the input colorspace, you 461should call jpeg_set_defaults() to adjust for the new colorspace; and then 462you'll need to repeat all of step 3. 463 464 4657. Release the JPEG compression object. 466 467When you are done with a JPEG compression object, destroy it by calling 468jpeg_destroy_compress(). This will free all subsidiary memory (regardless of 469the previous state of the object). Or you can call jpeg_destroy(), which 470works for either compression or decompression objects --- this may be more 471convenient if you are sharing code between compression and decompression 472cases. (Actually, these routines are equivalent except for the declared type 473of the passed pointer. To avoid gripes from ANSI C compilers, jpeg_destroy() 474should be passed a j_common_ptr.) 475 476If you allocated the jpeg_compress_struct structure from malloc(), freeing 477it is your responsibility --- jpeg_destroy() won't. Ditto for the error 478handler structure. 479 480Typical code: 481 482 jpeg_destroy_compress(&cinfo); 483 484 4858. Aborting. 486 487If you decide to abort a compression cycle before finishing, you can clean up 488in either of two ways: 489 490* If you don't need the JPEG object any more, just call 491 jpeg_destroy_compress() or jpeg_destroy() to release memory. This is 492 legitimate at any point after calling jpeg_create_compress() --- in fact, 493 it's safe even if jpeg_create_compress() fails. 494 495* If you want to re-use the JPEG object, call jpeg_abort_compress(), or call 496 jpeg_abort() which works on both compression and decompression objects. 497 This will return the object to an idle state, releasing any working memory. 498 jpeg_abort() is allowed at any time after successful object creation. 499 500Note that cleaning up the data destination, if required, is your 501responsibility; neither of these routines will call term_destination(). 502(See "Compressed data handling", below, for more about that.) 503 504jpeg_destroy() and jpeg_abort() are the only safe calls to make on a JPEG 505object that has reported an error by calling error_exit (see "Error handling" 506for more info). The internal state of such an object is likely to be out of 507whack. Either of these two routines will return the object to a known state. 508 509 510Decompression details 511--------------------- 512 513Here we revisit the JPEG decompression outline given in the overview. 514 5151. Allocate and initialize a JPEG decompression object. 516 517This is just like initialization for compression, as discussed above, 518except that the object is a "struct jpeg_decompress_struct" and you 519call jpeg_create_decompress(). Error handling is exactly the same. 520 521Typical code: 522 523 struct jpeg_decompress_struct cinfo; 524 struct jpeg_error_mgr jerr; 525 ... 526 cinfo.err = jpeg_std_error(&jerr); 527 jpeg_create_decompress(&cinfo); 528 529(Both here and in the IJG code, we usually use variable name "cinfo" for 530both compression and decompression objects.) 531 532 5332. Specify the source of the compressed data (eg, a file). 534 535As previously mentioned, the JPEG library reads compressed data from a "data 536source" module. The library includes one data source module which knows how 537to read from a stdio stream. You can use your own source module if you want 538to do something else, as discussed later. 539 540If you use the standard source module, you must open the source stdio stream 541beforehand. Typical code for this step looks like: 542 543 FILE * infile; 544 ... 545 if ((infile = fopen(filename, "rb")) == NULL) { 546 fprintf(stderr, "can't open %s\n", filename); 547 exit(1); 548 } 549 jpeg_stdio_src(&cinfo, infile); 550 551where the last line invokes the standard source module. 552 553WARNING: it is critical that the binary compressed data be read unchanged. 554On non-Unix systems the stdio library may perform newline translation or 555otherwise corrupt binary data. To suppress this behavior, you may need to use 556a "b" option to fopen (as shown above), or use setmode() or another routine to 557put the stdio stream in binary mode. See cjpeg.c and djpeg.c for code that 558has been found to work on many systems. 559 560You may not change the data source between calling jpeg_read_header() and 561jpeg_finish_decompress(). If you wish to read a series of JPEG images from 562a single source file, you should repeat the jpeg_read_header() to 563jpeg_finish_decompress() sequence without reinitializing either the JPEG 564object or the data source module; this prevents buffered input data from 565being discarded. 566 567 5683. Call jpeg_read_header() to obtain image info. 569 570Typical code for this step is just 571 572 jpeg_read_header(&cinfo, TRUE); 573 574This will read the source datastream header markers, up to the beginning 575of the compressed data proper. On return, the image dimensions and other 576info have been stored in the JPEG object. The application may wish to 577consult this information before selecting decompression parameters. 578 579More complex code is necessary if 580 * A suspending data source is used --- in that case jpeg_read_header() 581 may return before it has read all the header data. See "I/O suspension", 582 below. The normal stdio source manager will NOT cause this to happen. 583 * Abbreviated JPEG files are to be processed --- see the section on 584 abbreviated datastreams. Standard applications that deal only in 585 interchange JPEG files need not be concerned with this case either. 586 587It is permissible to stop at this point if you just wanted to find out the 588image dimensions and other header info for a JPEG file. In that case, 589call jpeg_destroy() when you are done with the JPEG object, or call 590jpeg_abort() to return it to an idle state before selecting a new data 591source and reading another header. 592 593 5944. Set parameters for decompression. 595 596jpeg_read_header() sets appropriate default decompression parameters based on 597the properties of the image (in particular, its colorspace). However, you 598may well want to alter these defaults before beginning the decompression. 599For example, the default is to produce full color output from a color file. 600If you want colormapped output you must ask for it. Other options allow the 601returned image to be scaled and allow various speed/quality tradeoffs to be 602selected. "Decompression parameter selection", below, gives details. 603 604If the defaults are appropriate, nothing need be done at this step. 605 606Note that all default values are set by each call to jpeg_read_header(). 607If you reuse a decompression object, you cannot expect your parameter 608settings to be preserved across cycles, as you can for compression. 609You must set desired parameter values each time. 610 611 6125. jpeg_start_decompress(...); 613 614Once the parameter values are satisfactory, call jpeg_start_decompress() to 615begin decompression. This will initialize internal state, allocate working 616memory, and prepare for returning data. 617 618Typical code is just 619 620 jpeg_start_decompress(&cinfo); 621 622If you have requested a multi-pass operating mode, such as 2-pass color 623quantization, jpeg_start_decompress() will do everything needed before data 624output can begin. In this case jpeg_start_decompress() may take quite a while 625to complete. With a single-scan (non progressive) JPEG file and default 626decompression parameters, this will not happen; jpeg_start_decompress() will 627return quickly. 628 629After this call, the final output image dimensions, including any requested 630scaling, are available in the JPEG object; so is the selected colormap, if 631colormapped output has been requested. Useful fields include 632 633 output_width image width and height, as scaled 634 output_height 635 out_color_components # of color components in out_color_space 636 output_components # of color components returned per pixel 637 colormap the selected colormap, if any 638 actual_number_of_colors number of entries in colormap 639 640output_components is 1 (a colormap index) when quantizing colors; otherwise it 641equals out_color_components. It is the number of JSAMPLE values that will be 642emitted per pixel in the output arrays. 643 644Typically you will need to allocate data buffers to hold the incoming image. 645You will need output_width * output_components JSAMPLEs per scanline in your 646output buffer, and a total of output_height scanlines will be returned. 647 648Note: if you are using the JPEG library's internal memory manager to allocate 649data buffers (as djpeg does), then the manager's protocol requires that you 650request large buffers *before* calling jpeg_start_decompress(). This is a 651little tricky since the output_XXX fields are not normally valid then. You 652can make them valid by calling jpeg_calc_output_dimensions() after setting the 653relevant parameters (scaling, output color space, and quantization flag). 654 655 6566. while (scan lines remain to be read) 657 jpeg_read_scanlines(...); 658 659Now you can read the decompressed image data by calling jpeg_read_scanlines() 660one or more times. At each call, you pass in the maximum number of scanlines 661to be read (ie, the height of your working buffer); jpeg_read_scanlines() 662will return up to that many lines. The return value is the number of lines 663actually read. The format of the returned data is discussed under "Data 664formats", above. Don't forget that grayscale and color JPEGs will return 665different data formats! 666 667Image data is returned in top-to-bottom scanline order. If you must write 668out the image in bottom-to-top order, you can use the JPEG library's virtual 669array mechanism to invert the data efficiently. Examples of this can be 670found in the sample application djpeg. 671 672The library maintains a count of the number of scanlines returned so far 673in the output_scanline field of the JPEG object. Usually you can just use 674this variable as the loop counter, so that the loop test looks like 675"while (cinfo.output_scanline < cinfo.output_height)". (Note that the test 676should NOT be against image_height, unless you never use scaling. The 677image_height field is the height of the original unscaled image.) 678The return value always equals the change in the value of output_scanline. 679 680If you don't use a suspending data source, it is safe to assume that 681jpeg_read_scanlines() reads at least one scanline per call, until the 682bottom of the image has been reached. 683 684If you use a buffer larger than one scanline, it is NOT safe to assume that 685jpeg_read_scanlines() fills it. (The current implementation returns only a 686few scanlines per call, no matter how large a buffer you pass.) So you must 687always provide a loop that calls jpeg_read_scanlines() repeatedly until the 688whole image has been read. 689 690 6917. jpeg_finish_decompress(...); 692 693After all the image data has been read, call jpeg_finish_decompress() to 694complete the decompression cycle. This causes working memory associated 695with the JPEG object to be released. 696 697Typical code: 698 699 jpeg_finish_decompress(&cinfo); 700 701If using the stdio source manager, don't forget to close the source stdio 702stream if necessary. 703 704It is an error to call jpeg_finish_decompress() before reading the correct 705total number of scanlines. If you wish to abort decompression, call 706jpeg_abort() as discussed below. 707 708After completing a decompression cycle, you may dispose of the JPEG object as 709discussed next, or you may use it to decompress another image. In that case 710return to step 2 or 3 as appropriate. If you do not change the source 711manager, the next image will be read from the same source. 712 713 7148. Release the JPEG decompression object. 715 716When you are done with a JPEG decompression object, destroy it by calling 717jpeg_destroy_decompress() or jpeg_destroy(). The previous discussion of 718destroying compression objects applies here too. 719 720Typical code: 721 722 jpeg_destroy_decompress(&cinfo); 723 724 7259. Aborting. 726 727You can abort a decompression cycle by calling jpeg_destroy_decompress() or 728jpeg_destroy() if you don't need the JPEG object any more, or 729jpeg_abort_decompress() or jpeg_abort() if you want to reuse the object. 730The previous discussion of aborting compression cycles applies here too. 731 732 733Partial image decompression 734--------------------------- 735 736Partial image decompression is convenient for performance-critical applications 737that wish to view only a portion of a large JPEG image without decompressing 738the whole thing. It it also useful in memory-constrained environments (such as 739on mobile devices.) This library provides the following functions to support 740partial image decompression: 741 7421. Skipping rows when decompressing 743 744 jpeg_skip_scanlines(j_decompress_ptr cinfo, JDIMENSION num_lines); 745 746This function provides application programmers with the ability to skip over 747multiple rows in the JPEG image. 748 749Suspending data sources are not supported by this function. Calling 750jpeg_skip_scanlines() with a suspending data source will result in undefined 751behavior. 752 753jpeg_skip_scanlines() will not allow skipping past the bottom of the image. If 754the value of num_lines is large enough to skip past the bottom of the image, 755then the function will skip to the end of the image instead. 756 757If the value of num_lines is valid, then jpeg_skip_scanlines() will always 758skip all of the input rows requested. There is no need to inspect the return 759value of the function in that case. 760 761Best results will be achieved by calling jpeg_skip_scanlines() for large chunks 762of rows. The function should be viewed as a way to quickly jump to a 763particular vertical offset in the JPEG image in order to decode a subset of the 764image. Used in this manner, it will provide significant performance 765improvements. 766 767Calling jpeg_skip_scanlines() for small values of num_lines has several 768potential drawbacks: 769 1) JPEG decompression occurs in blocks, so if jpeg_skip_scanlines() is 770 called from the middle of a decompression block, then it is likely that 771 much of the decompression work has already been done for the first 772 couple of rows that need to be skipped. 773 2) When this function returns, it must leave the decompressor in a state 774 such that it is ready to read the next line. This may involve 775 decompressing a block that must be partially skipped. 776These issues are especially tricky for cases in which upsampling requires 777context rows. In the worst case, jpeg_skip_scanlines() will perform similarly 778to jpeg_read_scanlines() (since it will actually call jpeg_read_scanlines().) 779 7802. Decompressing partial scanlines 781 782 jpeg_crop_scanline (j_decompress_ptr cinfo, JDIMENSION *xoffset, 783 JDIMENSION *width) 784 785This function provides application programmers with the ability to decompress 786only a portion of each row in the JPEG image. It must be called after 787jpeg_start_decompress() and before any calls to jpeg_read_scanlines() or 788jpeg_skip_scanlines(). 789 790If xoffset and width do not form a valid subset of the image row, then this 791function will generate an error. Note that if the output image is scaled, then 792xoffset and width are relative to the scaled image dimensions. 793 794xoffset and width are passed by reference because xoffset must fall on an iMCU 795boundary. If it doesn't, then it will be moved left to the nearest iMCU 796boundary, and width will be increased accordingly. If the calling program does 797not like the adjusted values of xoffset and width, then it can call 798jpeg_crop_scanline() again with new values (for instance, if it wants to move 799xoffset to the nearest iMCU boundary to the right instead of to the left.) 800 801After calling this function, cinfo->output_width will be set to the adjusted 802width. This value should be used when allocating an output buffer to pass to 803jpeg_read_scanlines(). 804 805The output image from a partial-width decompression will be identical to the 806corresponding image region from a full decode, with one exception: The "fancy" 807(smooth) h2v2 (4:2:0) and h2v1 (4:2:2) upsampling algorithms fill in the 808missing chroma components by averaging the chroma components from neighboring 809pixels, except on the right and left edges of the image (where there are no 810neighboring pixels.) When performing a partial-width decompression, these 811"fancy" upsampling algorithms may treat the left and right edges of the partial 812image region as if they are the left and right edges of the image, meaning that 813the upsampling algorithm may be simplified. The result is that the pixels on 814the left or right edge of the partial image may not be exactly identical to the 815corresponding pixels in the original image. 816 817 818Mechanics of usage: include files, linking, etc 819----------------------------------------------- 820 821Applications using the JPEG library should include the header file jpeglib.h 822to obtain declarations of data types and routines. Before including 823jpeglib.h, include system headers that define at least the typedefs FILE and 824size_t. On ANSI-conforming systems, including <stdio.h> is sufficient; on 825older Unix systems, you may need <sys/types.h> to define size_t. 826 827If the application needs to refer to individual JPEG library error codes, also 828include jerror.h to define those symbols. 829 830jpeglib.h indirectly includes the files jconfig.h and jmorecfg.h. If you are 831installing the JPEG header files in a system directory, you will want to 832install all four files: jpeglib.h, jerror.h, jconfig.h, jmorecfg.h. 833 834The most convenient way to include the JPEG code into your executable program 835is to prepare a library file ("libjpeg.a", or a corresponding name on non-Unix 836machines) and reference it at your link step. If you use only half of the 837library (only compression or only decompression), only that much code will be 838included from the library, unless your linker is hopelessly brain-damaged. 839The supplied makefiles build libjpeg.a automatically (see install.txt). 840 841While you can build the JPEG library as a shared library if the whim strikes 842you, we don't really recommend it. The trouble with shared libraries is that 843at some point you'll probably try to substitute a new version of the library 844without recompiling the calling applications. That generally doesn't work 845because the parameter struct declarations usually change with each new 846version. In other words, the library's API is *not* guaranteed binary 847compatible across versions; we only try to ensure source-code compatibility. 848(In hindsight, it might have been smarter to hide the parameter structs from 849applications and introduce a ton of access functions instead. Too late now, 850however.) 851 852It may be worth pointing out that the core JPEG library does not actually 853require the stdio library: only the default source/destination managers and 854error handler need it. You can use the library in a stdio-less environment 855if you replace those modules and use jmemnobs.c (or another memory manager of 856your own devising). More info about the minimum system library requirements 857may be found in jinclude.h. 858 859 860ADVANCED FEATURES 861================= 862 863Compression parameter selection 864------------------------------- 865 866This section describes all the optional parameters you can set for JPEG 867compression, as well as the "helper" routines provided to assist in this 868task. Proper setting of some parameters requires detailed understanding 869of the JPEG standard; if you don't know what a parameter is for, it's best 870not to mess with it! See REFERENCES in the README file for pointers to 871more info about JPEG. 872 873It's a good idea to call jpeg_set_defaults() first, even if you plan to set 874all the parameters; that way your code is more likely to work with future JPEG 875libraries that have additional parameters. For the same reason, we recommend 876you use a helper routine where one is provided, in preference to twiddling 877cinfo fields directly. 878 879The helper routines are: 880 881jpeg_set_defaults (j_compress_ptr cinfo) 882 This routine sets all JPEG parameters to reasonable defaults, using 883 only the input image's color space (field in_color_space, which must 884 already be set in cinfo). Many applications will only need to use 885 this routine and perhaps jpeg_set_quality(). 886 887jpeg_set_colorspace (j_compress_ptr cinfo, J_COLOR_SPACE colorspace) 888 Sets the JPEG file's colorspace (field jpeg_color_space) as specified, 889 and sets other color-space-dependent parameters appropriately. See 890 "Special color spaces", below, before using this. A large number of 891 parameters, including all per-component parameters, are set by this 892 routine; if you want to twiddle individual parameters you should call 893 jpeg_set_colorspace() before rather than after. 894 895jpeg_default_colorspace (j_compress_ptr cinfo) 896 Selects an appropriate JPEG colorspace based on cinfo->in_color_space, 897 and calls jpeg_set_colorspace(). This is actually a subroutine of 898 jpeg_set_defaults(). It's broken out in case you want to change 899 just the colorspace-dependent JPEG parameters. 900 901jpeg_set_quality (j_compress_ptr cinfo, int quality, boolean force_baseline) 902 Constructs JPEG quantization tables appropriate for the indicated 903 quality setting. The quality value is expressed on the 0..100 scale 904 recommended by IJG (cjpeg's "-quality" switch uses this routine). 905 Note that the exact mapping from quality values to tables may change 906 in future IJG releases as more is learned about DCT quantization. 907 If the force_baseline parameter is TRUE, then the quantization table 908 entries are constrained to the range 1..255 for full JPEG baseline 909 compatibility. In the current implementation, this only makes a 910 difference for quality settings below 25, and it effectively prevents 911 very small/low quality files from being generated. The IJG decoder 912 is capable of reading the non-baseline files generated at low quality 913 settings when force_baseline is FALSE, but other decoders may not be. 914 915jpeg_set_linear_quality (j_compress_ptr cinfo, int scale_factor, 916 boolean force_baseline) 917 Same as jpeg_set_quality() except that the generated tables are the 918 sample tables given in the JPEC spec section K.1, multiplied by the 919 specified scale factor (which is expressed as a percentage; thus 920 scale_factor = 100 reproduces the spec's tables). Note that larger 921 scale factors give lower quality. This entry point is useful for 922 conforming to the Adobe PostScript DCT conventions, but we do not 923 recommend linear scaling as a user-visible quality scale otherwise. 924 force_baseline again constrains the computed table entries to 1..255. 925 926int jpeg_quality_scaling (int quality) 927 Converts a value on the IJG-recommended quality scale to a linear 928 scaling percentage. Note that this routine may change or go away 929 in future releases --- IJG may choose to adopt a scaling method that 930 can't be expressed as a simple scalar multiplier, in which case the 931 premise of this routine collapses. Caveat user. 932 933jpeg_default_qtables (j_compress_ptr cinfo, boolean force_baseline) 934 [libjpeg v7+ API/ABI emulation only] 935 Set default quantization tables with linear q_scale_factor[] values 936 (see below). 937 938jpeg_add_quant_table (j_compress_ptr cinfo, int which_tbl, 939 const unsigned int *basic_table, 940 int scale_factor, boolean force_baseline) 941 Allows an arbitrary quantization table to be created. which_tbl 942 indicates which table slot to fill. basic_table points to an array 943 of 64 unsigned ints given in normal array order. These values are 944 multiplied by scale_factor/100 and then clamped to the range 1..65535 945 (or to 1..255 if force_baseline is TRUE). 946 CAUTION: prior to library version 6a, jpeg_add_quant_table expected 947 the basic table to be given in JPEG zigzag order. If you need to 948 write code that works with either older or newer versions of this 949 routine, you must check the library version number. Something like 950 "#if JPEG_LIB_VERSION >= 61" is the right test. 951 952jpeg_simple_progression (j_compress_ptr cinfo) 953 Generates a default scan script for writing a progressive-JPEG file. 954 This is the recommended method of creating a progressive file, 955 unless you want to make a custom scan sequence. You must ensure that 956 the JPEG color space is set correctly before calling this routine. 957 958 959Compression parameters (cinfo fields) include: 960 961J_DCT_METHOD dct_method 962 Selects the algorithm used for the DCT step. Choices are: 963 JDCT_ISLOW: slow but accurate integer algorithm 964 JDCT_IFAST: faster, less accurate integer method 965 JDCT_FLOAT: floating-point method 966 JDCT_DEFAULT: default method (normally JDCT_ISLOW) 967 JDCT_FASTEST: fastest method (normally JDCT_IFAST) 968 In libjpeg-turbo, JDCT_IFAST is generally about 5-15% faster than 969 JDCT_ISLOW when using the x86/x86-64 SIMD extensions (results may vary 970 with other SIMD implementations, or when using libjpeg-turbo without 971 SIMD extensions.) For quality levels of 90 and below, there should be 972 little or no perceptible difference between the two algorithms. For 973 quality levels above 90, however, the difference between JDCT_IFAST and 974 JDCT_ISLOW becomes more pronounced. With quality=97, for instance, 975 JDCT_IFAST incurs generally about a 1-3 dB loss (in PSNR) relative to 976 JDCT_ISLOW, but this can be larger for some images. Do not use 977 JDCT_IFAST with quality levels above 97. The algorithm often 978 degenerates at quality=98 and above and can actually produce a more 979 lossy image than if lower quality levels had been used. Also, in 980 libjpeg-turbo, JDCT_IFAST is not fully accelerated for quality levels 981 above 97, so it will be slower than JDCT_ISLOW. JDCT_FLOAT is mainly a 982 legacy feature. It does not produce significantly more accurate 983 results than the ISLOW method, and it is much slower. The FLOAT method 984 may also give different results on different machines due to varying 985 roundoff behavior, whereas the integer methods should give the same 986 results on all machines. 987 988J_COLOR_SPACE jpeg_color_space 989int num_components 990 The JPEG color space and corresponding number of components; see 991 "Special color spaces", below, for more info. We recommend using 992 jpeg_set_color_space() if you want to change these. 993 994boolean optimize_coding 995 TRUE causes the compressor to compute optimal Huffman coding tables 996 for the image. This requires an extra pass over the data and 997 therefore costs a good deal of space and time. The default is 998 FALSE, which tells the compressor to use the supplied or default 999 Huffman tables. In most cases optimal tables save only a few percent 1000 of file size compared to the default tables. Note that when this is 1001 TRUE, you need not supply Huffman tables at all, and any you do 1002 supply will be overwritten. 1003 1004unsigned int restart_interval 1005int restart_in_rows 1006 To emit restart markers in the JPEG file, set one of these nonzero. 1007 Set restart_interval to specify the exact interval in MCU blocks. 1008 Set restart_in_rows to specify the interval in MCU rows. (If 1009 restart_in_rows is not 0, then restart_interval is set after the 1010 image width in MCUs is computed.) Defaults are zero (no restarts). 1011 One restart marker per MCU row is often a good choice. 1012 NOTE: the overhead of restart markers is higher in grayscale JPEG 1013 files than in color files, and MUCH higher in progressive JPEGs. 1014 If you use restarts, you may want to use larger intervals in those 1015 cases. 1016 1017const jpeg_scan_info * scan_info 1018int num_scans 1019 By default, scan_info is NULL; this causes the compressor to write a 1020 single-scan sequential JPEG file. If not NULL, scan_info points to 1021 an array of scan definition records of length num_scans. The 1022 compressor will then write a JPEG file having one scan for each scan 1023 definition record. This is used to generate noninterleaved or 1024 progressive JPEG files. The library checks that the scan array 1025 defines a valid JPEG scan sequence. (jpeg_simple_progression creates 1026 a suitable scan definition array for progressive JPEG.) This is 1027 discussed further under "Progressive JPEG support". 1028 1029int smoothing_factor 1030 If non-zero, the input image is smoothed; the value should be 1 for 1031 minimal smoothing to 100 for maximum smoothing. Consult jcsample.c 1032 for details of the smoothing algorithm. The default is zero. 1033 1034boolean write_JFIF_header 1035 If TRUE, a JFIF APP0 marker is emitted. jpeg_set_defaults() and 1036 jpeg_set_colorspace() set this TRUE if a JFIF-legal JPEG color space 1037 (ie, YCbCr or grayscale) is selected, otherwise FALSE. 1038 1039UINT8 JFIF_major_version 1040UINT8 JFIF_minor_version 1041 The version number to be written into the JFIF marker. 1042 jpeg_set_defaults() initializes the version to 1.01 (major=minor=1). 1043 You should set it to 1.02 (major=1, minor=2) if you plan to write 1044 any JFIF 1.02 extension markers. 1045 1046UINT8 density_unit 1047UINT16 X_density 1048UINT16 Y_density 1049 The resolution information to be written into the JFIF marker; 1050 not used otherwise. density_unit may be 0 for unknown, 1051 1 for dots/inch, or 2 for dots/cm. The default values are 0,1,1 1052 indicating square pixels of unknown size. 1053 1054boolean write_Adobe_marker 1055 If TRUE, an Adobe APP14 marker is emitted. jpeg_set_defaults() and 1056 jpeg_set_colorspace() set this TRUE if JPEG color space RGB, CMYK, 1057 or YCCK is selected, otherwise FALSE. It is generally a bad idea 1058 to set both write_JFIF_header and write_Adobe_marker. In fact, 1059 you probably shouldn't change the default settings at all --- the 1060 default behavior ensures that the JPEG file's color space can be 1061 recognized by the decoder. 1062 1063JQUANT_TBL * quant_tbl_ptrs[NUM_QUANT_TBLS] 1064 Pointers to coefficient quantization tables, one per table slot, 1065 or NULL if no table is defined for a slot. Usually these should 1066 be set via one of the above helper routines; jpeg_add_quant_table() 1067 is general enough to define any quantization table. The other 1068 routines will set up table slot 0 for luminance quality and table 1069 slot 1 for chrominance. 1070 1071int q_scale_factor[NUM_QUANT_TBLS] 1072 [libjpeg v7+ API/ABI emulation only] 1073 Linear quantization scaling factors (0-100, default 100) 1074 for use with jpeg_default_qtables(). 1075 See rdswitch.c and cjpeg.c for an example of usage. 1076 Note that the q_scale_factor[] values use "linear" scales, so JPEG 1077 quality levels chosen by the user must be converted to these scales 1078 using jpeg_quality_scaling(). Here is an example that corresponds to 1079 cjpeg -quality 90,70: 1080 1081 jpeg_set_defaults(cinfo); 1082 1083 /* Set luminance quality 90. */ 1084 cinfo->q_scale_factor[0] = jpeg_quality_scaling(90); 1085 /* Set chrominance quality 70. */ 1086 cinfo->q_scale_factor[1] = jpeg_quality_scaling(70); 1087 1088 jpeg_default_qtables(cinfo, force_baseline); 1089 1090 CAUTION: Setting separate quality levels for chrominance and luminance 1091 is mainly only useful if chrominance subsampling is disabled. 2x2 1092 chrominance subsampling (AKA "4:2:0") is the default, but you can 1093 explicitly disable subsampling as follows: 1094 1095 cinfo->comp_info[0].v_samp_factor = 1; 1096 cinfo->comp_info[0].h_samp_factor = 1; 1097 1098JHUFF_TBL * dc_huff_tbl_ptrs[NUM_HUFF_TBLS] 1099JHUFF_TBL * ac_huff_tbl_ptrs[NUM_HUFF_TBLS] 1100 Pointers to Huffman coding tables, one per table slot, or NULL if 1101 no table is defined for a slot. Slots 0 and 1 are filled with the 1102 JPEG sample tables by jpeg_set_defaults(). If you need to allocate 1103 more table structures, jpeg_alloc_huff_table() may be used. 1104 Note that optimal Huffman tables can be computed for an image 1105 by setting optimize_coding, as discussed above; there's seldom 1106 any need to mess with providing your own Huffman tables. 1107 1108 1109[libjpeg v7+ API/ABI emulation only] 1110The actual dimensions of the JPEG image that will be written to the file are 1111given by the following fields. These are computed from the input image 1112dimensions and the compression parameters by jpeg_start_compress(). You can 1113also call jpeg_calc_jpeg_dimensions() to obtain the values that will result 1114from the current parameter settings. This can be useful if you are trying 1115to pick a scaling ratio that will get close to a desired target size. 1116 1117JDIMENSION jpeg_width Actual dimensions of output image. 1118JDIMENSION jpeg_height 1119 1120 1121Per-component parameters are stored in the struct cinfo.comp_info[i] for 1122component number i. Note that components here refer to components of the 1123JPEG color space, *not* the source image color space. A suitably large 1124comp_info[] array is allocated by jpeg_set_defaults(); if you choose not 1125to use that routine, it's up to you to allocate the array. 1126 1127int component_id 1128 The one-byte identifier code to be recorded in the JPEG file for 1129 this component. For the standard color spaces, we recommend you 1130 leave the default values alone. 1131 1132int h_samp_factor 1133int v_samp_factor 1134 Horizontal and vertical sampling factors for the component; must 1135 be 1..4 according to the JPEG standard. Note that larger sampling 1136 factors indicate a higher-resolution component; many people find 1137 this behavior quite unintuitive. The default values are 2,2 for 1138 luminance components and 1,1 for chrominance components, except 1139 for grayscale where 1,1 is used. 1140 1141int quant_tbl_no 1142 Quantization table number for component. The default value is 1143 0 for luminance components and 1 for chrominance components. 1144 1145int dc_tbl_no 1146int ac_tbl_no 1147 DC and AC entropy coding table numbers. The default values are 1148 0 for luminance components and 1 for chrominance components. 1149 1150int component_index 1151 Must equal the component's index in comp_info[]. (Beginning in 1152 release v6, the compressor library will fill this in automatically; 1153 you don't have to.) 1154 1155 1156Decompression parameter selection 1157--------------------------------- 1158 1159Decompression parameter selection is somewhat simpler than compression 1160parameter selection, since all of the JPEG internal parameters are 1161recorded in the source file and need not be supplied by the application. 1162(Unless you are working with abbreviated files, in which case see 1163"Abbreviated datastreams", below.) Decompression parameters control 1164the postprocessing done on the image to deliver it in a format suitable 1165for the application's use. Many of the parameters control speed/quality 1166tradeoffs, in which faster decompression may be obtained at the price of 1167a poorer-quality image. The defaults select the highest quality (slowest) 1168processing. 1169 1170The following fields in the JPEG object are set by jpeg_read_header() and 1171may be useful to the application in choosing decompression parameters: 1172 1173JDIMENSION image_width Width and height of image 1174JDIMENSION image_height 1175int num_components Number of color components 1176J_COLOR_SPACE jpeg_color_space Colorspace of image 1177boolean saw_JFIF_marker TRUE if a JFIF APP0 marker was seen 1178 UINT8 JFIF_major_version Version information from JFIF marker 1179 UINT8 JFIF_minor_version 1180 UINT8 density_unit Resolution data from JFIF marker 1181 UINT16 X_density 1182 UINT16 Y_density 1183boolean saw_Adobe_marker TRUE if an Adobe APP14 marker was seen 1184 UINT8 Adobe_transform Color transform code from Adobe marker 1185 1186The JPEG color space, unfortunately, is something of a guess since the JPEG 1187standard proper does not provide a way to record it. In practice most files 1188adhere to the JFIF or Adobe conventions, and the decoder will recognize these 1189correctly. See "Special color spaces", below, for more info. 1190 1191 1192The decompression parameters that determine the basic properties of the 1193returned image are: 1194 1195J_COLOR_SPACE out_color_space 1196 Output color space. jpeg_read_header() sets an appropriate default 1197 based on jpeg_color_space; typically it will be RGB or grayscale. 1198 The application can change this field to request output in a different 1199 colorspace. For example, set it to JCS_GRAYSCALE to get grayscale 1200 output from a color file. (This is useful for previewing: grayscale 1201 output is faster than full color since the color components need not 1202 be processed.) Note that not all possible color space transforms are 1203 currently implemented; you may need to extend jdcolor.c if you want an 1204 unusual conversion. 1205 1206unsigned int scale_num, scale_denom 1207 Scale the image by the fraction scale_num/scale_denom. Default is 1208 1/1, or no scaling. Currently, the only supported scaling ratios 1209 are M/8 with all M from 1 to 16, or any reduced fraction thereof (such 1210 as 1/2, 3/4, etc.) (The library design allows for arbitrary 1211 scaling ratios but this is not likely to be implemented any time soon.) 1212 Smaller scaling ratios permit significantly faster decoding since 1213 fewer pixels need be processed and a simpler IDCT method can be used. 1214 1215boolean quantize_colors 1216 If set TRUE, colormapped output will be delivered. Default is FALSE, 1217 meaning that full-color output will be delivered. 1218 1219The next three parameters are relevant only if quantize_colors is TRUE. 1220 1221int desired_number_of_colors 1222 Maximum number of colors to use in generating a library-supplied color 1223 map (the actual number of colors is returned in a different field). 1224 Default 256. Ignored when the application supplies its own color map. 1225 1226boolean two_pass_quantize 1227 If TRUE, an extra pass over the image is made to select a custom color 1228 map for the image. This usually looks a lot better than the one-size- 1229 fits-all colormap that is used otherwise. Default is TRUE. Ignored 1230 when the application supplies its own color map. 1231 1232J_DITHER_MODE dither_mode 1233 Selects color dithering method. Supported values are: 1234 JDITHER_NONE no dithering: fast, very low quality 1235 JDITHER_ORDERED ordered dither: moderate speed and quality 1236 JDITHER_FS Floyd-Steinberg dither: slow, high quality 1237 Default is JDITHER_FS. (At present, ordered dither is implemented 1238 only in the single-pass, standard-colormap case. If you ask for 1239 ordered dither when two_pass_quantize is TRUE or when you supply 1240 an external color map, you'll get F-S dithering.) 1241 1242When quantize_colors is TRUE, the target color map is described by the next 1243two fields. colormap is set to NULL by jpeg_read_header(). The application 1244can supply a color map by setting colormap non-NULL and setting 1245actual_number_of_colors to the map size. Otherwise, jpeg_start_decompress() 1246selects a suitable color map and sets these two fields itself. 1247[Implementation restriction: at present, an externally supplied colormap is 1248only accepted for 3-component output color spaces.] 1249 1250JSAMPARRAY colormap 1251 The color map, represented as a 2-D pixel array of out_color_components 1252 rows and actual_number_of_colors columns. Ignored if not quantizing. 1253 CAUTION: if the JPEG library creates its own colormap, the storage 1254 pointed to by this field is released by jpeg_finish_decompress(). 1255 Copy the colormap somewhere else first, if you want to save it. 1256 1257int actual_number_of_colors 1258 The number of colors in the color map. 1259 1260Additional decompression parameters that the application may set include: 1261 1262J_DCT_METHOD dct_method 1263 Selects the algorithm used for the DCT step. Choices are: 1264 JDCT_ISLOW: slow but accurate integer algorithm 1265 JDCT_IFAST: faster, less accurate integer method 1266 JDCT_FLOAT: floating-point method 1267 JDCT_DEFAULT: default method (normally JDCT_ISLOW) 1268 JDCT_FASTEST: fastest method (normally JDCT_IFAST) 1269 In libjpeg-turbo, JDCT_IFAST is generally about 5-15% faster than 1270 JDCT_ISLOW when using the x86/x86-64 SIMD extensions (results may vary 1271 with other SIMD implementations, or when using libjpeg-turbo without 1272 SIMD extensions.) If the JPEG image was compressed using a quality 1273 level of 85 or below, then there should be little or no perceptible 1274 difference between the two algorithms. When decompressing images that 1275 were compressed using quality levels above 85, however, the difference 1276 between JDCT_IFAST and JDCT_ISLOW becomes more pronounced. With images 1277 compressed using quality=97, for instance, JDCT_IFAST incurs generally 1278 about a 4-6 dB loss (in PSNR) relative to JDCT_ISLOW, but this can be 1279 larger for some images. If you can avoid it, do not use JDCT_IFAST 1280 when decompressing images that were compressed using quality levels 1281 above 97. The algorithm often degenerates for such images and can 1282 actually produce a more lossy output image than if the JPEG image had 1283 been compressed using lower quality levels. JDCT_FLOAT is mainly a 1284 legacy feature. It does not produce significantly more accurate 1285 results than the ISLOW method, and it is much slower. The FLOAT method 1286 may also give different results on different machines due to varying 1287 roundoff behavior, whereas the integer methods should give the same 1288 results on all machines. 1289 1290boolean do_fancy_upsampling 1291 If TRUE, do careful upsampling of chroma components. If FALSE, 1292 a faster but sloppier method is used. Default is TRUE. The visual 1293 impact of the sloppier method is often very small. 1294 1295boolean do_block_smoothing 1296 If TRUE, interblock smoothing is applied in early stages of decoding 1297 progressive JPEG files; if FALSE, not. Default is TRUE. Early 1298 progression stages look "fuzzy" with smoothing, "blocky" without. 1299 In any case, block smoothing ceases to be applied after the first few 1300 AC coefficients are known to full accuracy, so it is relevant only 1301 when using buffered-image mode for progressive images. 1302 1303boolean enable_1pass_quant 1304boolean enable_external_quant 1305boolean enable_2pass_quant 1306 These are significant only in buffered-image mode, which is 1307 described in its own section below. 1308 1309 1310The output image dimensions are given by the following fields. These are 1311computed from the source image dimensions and the decompression parameters 1312by jpeg_start_decompress(). You can also call jpeg_calc_output_dimensions() 1313to obtain the values that will result from the current parameter settings. 1314This can be useful if you are trying to pick a scaling ratio that will get 1315close to a desired target size. It's also important if you are using the 1316JPEG library's memory manager to allocate output buffer space, because you 1317are supposed to request such buffers *before* jpeg_start_decompress(). 1318 1319JDIMENSION output_width Actual dimensions of output image. 1320JDIMENSION output_height 1321int out_color_components Number of color components in out_color_space. 1322int output_components Number of color components returned. 1323int rec_outbuf_height Recommended height of scanline buffer. 1324 1325When quantizing colors, output_components is 1, indicating a single color map 1326index per pixel. Otherwise it equals out_color_components. The output arrays 1327are required to be output_width * output_components JSAMPLEs wide. 1328 1329rec_outbuf_height is the recommended minimum height (in scanlines) of the 1330buffer passed to jpeg_read_scanlines(). If the buffer is smaller, the 1331library will still work, but time will be wasted due to unnecessary data 1332copying. In high-quality modes, rec_outbuf_height is always 1, but some 1333faster, lower-quality modes set it to larger values (typically 2 to 4). 1334If you are going to ask for a high-speed processing mode, you may as well 1335go to the trouble of honoring rec_outbuf_height so as to avoid data copying. 1336(An output buffer larger than rec_outbuf_height lines is OK, but won't 1337provide any material speed improvement over that height.) 1338 1339 1340Special color spaces 1341-------------------- 1342 1343The JPEG standard itself is "color blind" and doesn't specify any particular 1344color space. It is customary to convert color data to a luminance/chrominance 1345color space before compressing, since this permits greater compression. The 1346existing de-facto JPEG file format standards specify YCbCr or grayscale data 1347(JFIF), or grayscale, RGB, YCbCr, CMYK, or YCCK (Adobe). For special 1348applications such as multispectral images, other color spaces can be used, 1349but it must be understood that such files will be unportable. 1350 1351The JPEG library can handle the most common colorspace conversions (namely 1352RGB <=> YCbCr and CMYK <=> YCCK). It can also deal with data of an unknown 1353color space, passing it through without conversion. If you deal extensively 1354with an unusual color space, you can easily extend the library to understand 1355additional color spaces and perform appropriate conversions. 1356 1357For compression, the source data's color space is specified by field 1358in_color_space. This is transformed to the JPEG file's color space given 1359by jpeg_color_space. jpeg_set_defaults() chooses a reasonable JPEG color 1360space depending on in_color_space, but you can override this by calling 1361jpeg_set_colorspace(). Of course you must select a supported transformation. 1362jccolor.c currently supports the following transformations: 1363 RGB => YCbCr 1364 RGB => GRAYSCALE 1365 YCbCr => GRAYSCALE 1366 CMYK => YCCK 1367plus the null transforms: GRAYSCALE => GRAYSCALE, RGB => RGB, 1368YCbCr => YCbCr, CMYK => CMYK, YCCK => YCCK, and UNKNOWN => UNKNOWN. 1369 1370The de-facto file format standards (JFIF and Adobe) specify APPn markers that 1371indicate the color space of the JPEG file. It is important to ensure that 1372these are written correctly, or omitted if the JPEG file's color space is not 1373one of the ones supported by the de-facto standards. jpeg_set_colorspace() 1374will set the compression parameters to include or omit the APPn markers 1375properly, so long as it is told the truth about the JPEG color space. 1376For example, if you are writing some random 3-component color space without 1377conversion, don't try to fake out the library by setting in_color_space and 1378jpeg_color_space to JCS_YCbCr; use JCS_UNKNOWN. You may want to write an 1379APPn marker of your own devising to identify the colorspace --- see "Special 1380markers", below. 1381 1382When told that the color space is UNKNOWN, the library will default to using 1383luminance-quality compression parameters for all color components. You may 1384well want to change these parameters. See the source code for 1385jpeg_set_colorspace(), in jcparam.c, for details. 1386 1387For decompression, the JPEG file's color space is given in jpeg_color_space, 1388and this is transformed to the output color space out_color_space. 1389jpeg_read_header's setting of jpeg_color_space can be relied on if the file 1390conforms to JFIF or Adobe conventions, but otherwise it is no better than a 1391guess. If you know the JPEG file's color space for certain, you can override 1392jpeg_read_header's guess by setting jpeg_color_space. jpeg_read_header also 1393selects a default output color space based on (its guess of) jpeg_color_space; 1394set out_color_space to override this. Again, you must select a supported 1395transformation. jdcolor.c currently supports 1396 YCbCr => RGB 1397 YCbCr => GRAYSCALE 1398 RGB => GRAYSCALE 1399 GRAYSCALE => RGB 1400 YCCK => CMYK 1401as well as the null transforms. (Since GRAYSCALE=>RGB is provided, an 1402application can force grayscale JPEGs to look like color JPEGs if it only 1403wants to handle one case.) 1404 1405The two-pass color quantizer, jquant2.c, is specialized to handle RGB data 1406(it weights distances appropriately for RGB colors). You'll need to modify 1407the code if you want to use it for non-RGB output color spaces. Note that 1408jquant2.c is used to map to an application-supplied colormap as well as for 1409the normal two-pass colormap selection process. 1410 1411CAUTION: it appears that Adobe Photoshop writes inverted data in CMYK JPEG 1412files: 0 represents 100% ink coverage, rather than 0% ink as you'd expect. 1413This is arguably a bug in Photoshop, but if you need to work with Photoshop 1414CMYK files, you will have to deal with it in your application. We cannot 1415"fix" this in the library by inverting the data during the CMYK<=>YCCK 1416transform, because that would break other applications, notably Ghostscript. 1417Photoshop versions prior to 3.0 write EPS files containing JPEG-encoded CMYK 1418data in the same inverted-YCCK representation used in bare JPEG files, but 1419the surrounding PostScript code performs an inversion using the PS image 1420operator. I am told that Photoshop 3.0 will write uninverted YCCK in 1421EPS/JPEG files, and will omit the PS-level inversion. (But the data 1422polarity used in bare JPEG files will not change in 3.0.) In either case, 1423the JPEG library must not invert the data itself, or else Ghostscript would 1424read these EPS files incorrectly. 1425 1426 1427Error handling 1428-------------- 1429 1430When the default error handler is used, any error detected inside the JPEG 1431routines will cause a message to be printed on stderr, followed by exit(). 1432You can supply your own error handling routines to override this behavior 1433and to control the treatment of nonfatal warnings and trace/debug messages. 1434The file example.c illustrates the most common case, which is to have the 1435application regain control after an error rather than exiting. 1436 1437The JPEG library never writes any message directly; it always goes through 1438the error handling routines. Three classes of messages are recognized: 1439 * Fatal errors: the library cannot continue. 1440 * Warnings: the library can continue, but the data is corrupt, and a 1441 damaged output image is likely to result. 1442 * Trace/informational messages. These come with a trace level indicating 1443 the importance of the message; you can control the verbosity of the 1444 program by adjusting the maximum trace level that will be displayed. 1445 1446You may, if you wish, simply replace the entire JPEG error handling module 1447(jerror.c) with your own code. However, you can avoid code duplication by 1448only replacing some of the routines depending on the behavior you need. 1449This is accomplished by calling jpeg_std_error() as usual, but then overriding 1450some of the method pointers in the jpeg_error_mgr struct, as illustrated by 1451example.c. 1452 1453All of the error handling routines will receive a pointer to the JPEG object 1454(a j_common_ptr which points to either a jpeg_compress_struct or a 1455jpeg_decompress_struct; if you need to tell which, test the is_decompressor 1456field). This struct includes a pointer to the error manager struct in its 1457"err" field. Frequently, custom error handler routines will need to access 1458additional data which is not known to the JPEG library or the standard error 1459handler. The most convenient way to do this is to embed either the JPEG 1460object or the jpeg_error_mgr struct in a larger structure that contains 1461additional fields; then casting the passed pointer provides access to the 1462additional fields. Again, see example.c for one way to do it. (Beginning 1463with IJG version 6b, there is also a void pointer "client_data" in each 1464JPEG object, which the application can also use to find related data. 1465The library does not touch client_data at all.) 1466 1467The individual methods that you might wish to override are: 1468 1469error_exit (j_common_ptr cinfo) 1470 Receives control for a fatal error. Information sufficient to 1471 generate the error message has been stored in cinfo->err; call 1472 output_message to display it. Control must NOT return to the caller; 1473 generally this routine will exit() or longjmp() somewhere. 1474 Typically you would override this routine to get rid of the exit() 1475 default behavior. Note that if you continue processing, you should 1476 clean up the JPEG object with jpeg_abort() or jpeg_destroy(). 1477 1478output_message (j_common_ptr cinfo) 1479 Actual output of any JPEG message. Override this to send messages 1480 somewhere other than stderr. Note that this method does not know 1481 how to generate a message, only where to send it. 1482 1483format_message (j_common_ptr cinfo, char * buffer) 1484 Constructs a readable error message string based on the error info 1485 stored in cinfo->err. This method is called by output_message. Few 1486 applications should need to override this method. One possible 1487 reason for doing so is to implement dynamic switching of error message 1488 language. 1489 1490emit_message (j_common_ptr cinfo, int msg_level) 1491 Decide whether or not to emit a warning or trace message; if so, 1492 calls output_message. The main reason for overriding this method 1493 would be to abort on warnings. msg_level is -1 for warnings, 1494 0 and up for trace messages. 1495 1496Only error_exit() and emit_message() are called from the rest of the JPEG 1497library; the other two are internal to the error handler. 1498 1499The actual message texts are stored in an array of strings which is pointed to 1500by the field err->jpeg_message_table. The messages are numbered from 0 to 1501err->last_jpeg_message, and it is these code numbers that are used in the 1502JPEG library code. You could replace the message texts (for instance, with 1503messages in French or German) by changing the message table pointer. See 1504jerror.h for the default texts. CAUTION: this table will almost certainly 1505change or grow from one library version to the next. 1506 1507It may be useful for an application to add its own message texts that are 1508handled by the same mechanism. The error handler supports a second "add-on" 1509message table for this purpose. To define an addon table, set the pointer 1510err->addon_message_table and the message numbers err->first_addon_message and 1511err->last_addon_message. If you number the addon messages beginning at 1000 1512or so, you won't have to worry about conflicts with the library's built-in 1513messages. See the sample applications cjpeg/djpeg for an example of using 1514addon messages (the addon messages are defined in cderror.h). 1515 1516Actual invocation of the error handler is done via macros defined in jerror.h: 1517 ERREXITn(...) for fatal errors 1518 WARNMSn(...) for corrupt-data warnings 1519 TRACEMSn(...) for trace and informational messages. 1520These macros store the message code and any additional parameters into the 1521error handler struct, then invoke the error_exit() or emit_message() method. 1522The variants of each macro are for varying numbers of additional parameters. 1523The additional parameters are inserted into the generated message using 1524standard printf() format codes. 1525 1526See jerror.h and jerror.c for further details. 1527 1528 1529Compressed data handling (source and destination managers) 1530---------------------------------------------------------- 1531 1532The JPEG compression library sends its compressed data to a "destination 1533manager" module. The default destination manager just writes the data to a 1534memory buffer or to a stdio stream, but you can provide your own manager to 1535do something else. Similarly, the decompression library calls a "source 1536manager" to obtain the compressed data; you can provide your own source 1537manager if you want the data to come from somewhere other than a memory 1538buffer or a stdio stream. 1539 1540In both cases, compressed data is processed a bufferload at a time: the 1541destination or source manager provides a work buffer, and the library invokes 1542the manager only when the buffer is filled or emptied. (You could define a 1543one-character buffer to force the manager to be invoked for each byte, but 1544that would be rather inefficient.) The buffer's size and location are 1545controlled by the manager, not by the library. For example, the memory 1546source manager just makes the buffer pointer and length point to the original 1547data in memory. In this case the buffer-reload procedure will be invoked 1548only if the decompressor ran off the end of the datastream, which would 1549indicate an erroneous datastream. 1550 1551The work buffer is defined as an array of datatype JOCTET, which is generally 1552"char" or "unsigned char". On a machine where char is not exactly 8 bits 1553wide, you must define JOCTET as a wider data type and then modify the data 1554source and destination modules to transcribe the work arrays into 8-bit units 1555on external storage. 1556 1557A data destination manager struct contains a pointer and count defining the 1558next byte to write in the work buffer and the remaining free space: 1559 1560 JOCTET * next_output_byte; /* => next byte to write in buffer */ 1561 size_t free_in_buffer; /* # of byte spaces remaining in buffer */ 1562 1563The library increments the pointer and decrements the count until the buffer 1564is filled. The manager's empty_output_buffer method must reset the pointer 1565and count. The manager is expected to remember the buffer's starting address 1566and total size in private fields not visible to the library. 1567 1568A data destination manager provides three methods: 1569 1570init_destination (j_compress_ptr cinfo) 1571 Initialize destination. This is called by jpeg_start_compress() 1572 before any data is actually written. It must initialize 1573 next_output_byte and free_in_buffer. free_in_buffer must be 1574 initialized to a positive value. 1575 1576empty_output_buffer (j_compress_ptr cinfo) 1577 This is called whenever the buffer has filled (free_in_buffer 1578 reaches zero). In typical applications, it should write out the 1579 *entire* buffer (use the saved start address and buffer length; 1580 ignore the current state of next_output_byte and free_in_buffer). 1581 Then reset the pointer & count to the start of the buffer, and 1582 return TRUE indicating that the buffer has been dumped. 1583 free_in_buffer must be set to a positive value when TRUE is 1584 returned. A FALSE return should only be used when I/O suspension is 1585 desired (this operating mode is discussed in the next section). 1586 1587term_destination (j_compress_ptr cinfo) 1588 Terminate destination --- called by jpeg_finish_compress() after all 1589 data has been written. In most applications, this must flush any 1590 data remaining in the buffer. Use either next_output_byte or 1591 free_in_buffer to determine how much data is in the buffer. 1592 1593term_destination() is NOT called by jpeg_abort() or jpeg_destroy(). If you 1594want the destination manager to be cleaned up during an abort, you must do it 1595yourself. 1596 1597You will also need code to create a jpeg_destination_mgr struct, fill in its 1598method pointers, and insert a pointer to the struct into the "dest" field of 1599the JPEG compression object. This can be done in-line in your setup code if 1600you like, but it's probably cleaner to provide a separate routine similar to 1601the jpeg_stdio_dest() or jpeg_mem_dest() routines of the supplied destination 1602managers. 1603 1604Decompression source managers follow a parallel design, but with some 1605additional frammishes. The source manager struct contains a pointer and count 1606defining the next byte to read from the work buffer and the number of bytes 1607remaining: 1608 1609 const JOCTET * next_input_byte; /* => next byte to read from buffer */ 1610 size_t bytes_in_buffer; /* # of bytes remaining in buffer */ 1611 1612The library increments the pointer and decrements the count until the buffer 1613is emptied. The manager's fill_input_buffer method must reset the pointer and 1614count. In most applications, the manager must remember the buffer's starting 1615address and total size in private fields not visible to the library. 1616 1617A data source manager provides five methods: 1618 1619init_source (j_decompress_ptr cinfo) 1620 Initialize source. This is called by jpeg_read_header() before any 1621 data is actually read. Unlike init_destination(), it may leave 1622 bytes_in_buffer set to 0 (in which case a fill_input_buffer() call 1623 will occur immediately). 1624 1625fill_input_buffer (j_decompress_ptr cinfo) 1626 This is called whenever bytes_in_buffer has reached zero and more 1627 data is wanted. In typical applications, it should read fresh data 1628 into the buffer (ignoring the current state of next_input_byte and 1629 bytes_in_buffer), reset the pointer & count to the start of the 1630 buffer, and return TRUE indicating that the buffer has been reloaded. 1631 It is not necessary to fill the buffer entirely, only to obtain at 1632 least one more byte. bytes_in_buffer MUST be set to a positive value 1633 if TRUE is returned. A FALSE return should only be used when I/O 1634 suspension is desired (this mode is discussed in the next section). 1635 1636skip_input_data (j_decompress_ptr cinfo, long num_bytes) 1637 Skip num_bytes worth of data. The buffer pointer and count should 1638 be advanced over num_bytes input bytes, refilling the buffer as 1639 needed. This is used to skip over a potentially large amount of 1640 uninteresting data (such as an APPn marker). In some applications 1641 it may be possible to optimize away the reading of the skipped data, 1642 but it's not clear that being smart is worth much trouble; large 1643 skips are uncommon. bytes_in_buffer may be zero on return. 1644 A zero or negative skip count should be treated as a no-op. 1645 1646resync_to_restart (j_decompress_ptr cinfo, int desired) 1647 This routine is called only when the decompressor has failed to find 1648 a restart (RSTn) marker where one is expected. Its mission is to 1649 find a suitable point for resuming decompression. For most 1650 applications, we recommend that you just use the default resync 1651 procedure, jpeg_resync_to_restart(). However, if you are able to back 1652 up in the input data stream, or if you have a-priori knowledge about 1653 the likely location of restart markers, you may be able to do better. 1654 Read the read_restart_marker() and jpeg_resync_to_restart() routines 1655 in jdmarker.c if you think you'd like to implement your own resync 1656 procedure. 1657 1658term_source (j_decompress_ptr cinfo) 1659 Terminate source --- called by jpeg_finish_decompress() after all 1660 data has been read. Often a no-op. 1661 1662For both fill_input_buffer() and skip_input_data(), there is no such thing 1663as an EOF return. If the end of the file has been reached, the routine has 1664a choice of exiting via ERREXIT() or inserting fake data into the buffer. 1665In most cases, generating a warning message and inserting a fake EOI marker 1666is the best course of action --- this will allow the decompressor to output 1667however much of the image is there. In pathological cases, the decompressor 1668may swallow the EOI and again demand data ... just keep feeding it fake EOIs. 1669jdatasrc.c illustrates the recommended error recovery behavior. 1670 1671term_source() is NOT called by jpeg_abort() or jpeg_destroy(). If you want 1672the source manager to be cleaned up during an abort, you must do it yourself. 1673 1674You will also need code to create a jpeg_source_mgr struct, fill in its method 1675pointers, and insert a pointer to the struct into the "src" field of the JPEG 1676decompression object. This can be done in-line in your setup code if you 1677like, but it's probably cleaner to provide a separate routine similar to the 1678jpeg_stdio_src() or jpeg_mem_src() routines of the supplied source managers. 1679 1680For more information, consult the memory and stdio source and destination 1681managers in jdatasrc.c and jdatadst.c. 1682 1683 1684I/O suspension 1685-------------- 1686 1687Some applications need to use the JPEG library as an incremental memory-to- 1688memory filter: when the compressed data buffer is filled or emptied, they want 1689control to return to the outer loop, rather than expecting that the buffer can 1690be emptied or reloaded within the data source/destination manager subroutine. 1691The library supports this need by providing an "I/O suspension" mode, which we 1692describe in this section. 1693 1694The I/O suspension mode is not a panacea: nothing is guaranteed about the 1695maximum amount of time spent in any one call to the library, so it will not 1696eliminate response-time problems in single-threaded applications. If you 1697need guaranteed response time, we suggest you "bite the bullet" and implement 1698a real multi-tasking capability. 1699 1700To use I/O suspension, cooperation is needed between the calling application 1701and the data source or destination manager; you will always need a custom 1702source/destination manager. (Please read the previous section if you haven't 1703already.) The basic idea is that the empty_output_buffer() or 1704fill_input_buffer() routine is a no-op, merely returning FALSE to indicate 1705that it has done nothing. Upon seeing this, the JPEG library suspends 1706operation and returns to its caller. The surrounding application is 1707responsible for emptying or refilling the work buffer before calling the 1708JPEG library again. 1709 1710Compression suspension: 1711 1712For compression suspension, use an empty_output_buffer() routine that returns 1713FALSE; typically it will not do anything else. This will cause the 1714compressor to return to the caller of jpeg_write_scanlines(), with the return 1715value indicating that not all the supplied scanlines have been accepted. 1716The application must make more room in the output buffer, adjust the output 1717buffer pointer/count appropriately, and then call jpeg_write_scanlines() 1718again, pointing to the first unconsumed scanline. 1719 1720When forced to suspend, the compressor will backtrack to a convenient stopping 1721point (usually the start of the current MCU); it will regenerate some output 1722data when restarted. Therefore, although empty_output_buffer() is only 1723called when the buffer is filled, you should NOT write out the entire buffer 1724after a suspension. Write only the data up to the current position of 1725next_output_byte/free_in_buffer. The data beyond that point will be 1726regenerated after resumption. 1727 1728Because of the backtracking behavior, a good-size output buffer is essential 1729for efficiency; you don't want the compressor to suspend often. (In fact, an 1730overly small buffer could lead to infinite looping, if a single MCU required 1731more data than would fit in the buffer.) We recommend a buffer of at least 1732several Kbytes. You may want to insert explicit code to ensure that you don't 1733call jpeg_write_scanlines() unless there is a reasonable amount of space in 1734the output buffer; in other words, flush the buffer before trying to compress 1735more data. 1736 1737The compressor does not allow suspension while it is trying to write JPEG 1738markers at the beginning and end of the file. This means that: 1739 * At the beginning of a compression operation, there must be enough free 1740 space in the output buffer to hold the header markers (typically 600 or 1741 so bytes). The recommended buffer size is bigger than this anyway, so 1742 this is not a problem as long as you start with an empty buffer. However, 1743 this restriction might catch you if you insert large special markers, such 1744 as a JFIF thumbnail image, without flushing the buffer afterwards. 1745 * When you call jpeg_finish_compress(), there must be enough space in the 1746 output buffer to emit any buffered data and the final EOI marker. In the 1747 current implementation, half a dozen bytes should suffice for this, but 1748 for safety's sake we recommend ensuring that at least 100 bytes are free 1749 before calling jpeg_finish_compress(). 1750 1751A more significant restriction is that jpeg_finish_compress() cannot suspend. 1752This means you cannot use suspension with multi-pass operating modes, namely 1753Huffman code optimization and multiple-scan output. Those modes write the 1754whole file during jpeg_finish_compress(), which will certainly result in 1755buffer overrun. (Note that this restriction applies only to compression, 1756not decompression. The decompressor supports input suspension in all of its 1757operating modes.) 1758 1759Decompression suspension: 1760 1761For decompression suspension, use a fill_input_buffer() routine that simply 1762returns FALSE (except perhaps during error recovery, as discussed below). 1763This will cause the decompressor to return to its caller with an indication 1764that suspension has occurred. This can happen at four places: 1765 * jpeg_read_header(): will return JPEG_SUSPENDED. 1766 * jpeg_start_decompress(): will return FALSE, rather than its usual TRUE. 1767 * jpeg_read_scanlines(): will return the number of scanlines already 1768 completed (possibly 0). 1769 * jpeg_finish_decompress(): will return FALSE, rather than its usual TRUE. 1770The surrounding application must recognize these cases, load more data into 1771the input buffer, and repeat the call. In the case of jpeg_read_scanlines(), 1772increment the passed pointers past any scanlines successfully read. 1773 1774Just as with compression, the decompressor will typically backtrack to a 1775convenient restart point before suspending. When fill_input_buffer() is 1776called, next_input_byte/bytes_in_buffer point to the current restart point, 1777which is where the decompressor will backtrack to if FALSE is returned. 1778The data beyond that position must NOT be discarded if you suspend; it needs 1779to be re-read upon resumption. In most implementations, you'll need to shift 1780this data down to the start of your work buffer and then load more data after 1781it. Again, this behavior means that a several-Kbyte work buffer is essential 1782for decent performance; furthermore, you should load a reasonable amount of 1783new data before resuming decompression. (If you loaded, say, only one new 1784byte each time around, you could waste a LOT of cycles.) 1785 1786The skip_input_data() source manager routine requires special care in a 1787suspension scenario. This routine is NOT granted the ability to suspend the 1788decompressor; it can decrement bytes_in_buffer to zero, but no more. If the 1789requested skip distance exceeds the amount of data currently in the input 1790buffer, then skip_input_data() must set bytes_in_buffer to zero and record the 1791additional skip distance somewhere else. The decompressor will immediately 1792call fill_input_buffer(), which should return FALSE, which will cause a 1793suspension return. The surrounding application must then arrange to discard 1794the recorded number of bytes before it resumes loading the input buffer. 1795(Yes, this design is rather baroque, but it avoids complexity in the far more 1796common case where a non-suspending source manager is used.) 1797 1798If the input data has been exhausted, we recommend that you emit a warning 1799and insert dummy EOI markers just as a non-suspending data source manager 1800would do. This can be handled either in the surrounding application logic or 1801within fill_input_buffer(); the latter is probably more efficient. If 1802fill_input_buffer() knows that no more data is available, it can set the 1803pointer/count to point to a dummy EOI marker and then return TRUE just as 1804though it had read more data in a non-suspending situation. 1805 1806The decompressor does not attempt to suspend within standard JPEG markers; 1807instead it will backtrack to the start of the marker and reprocess the whole 1808marker next time. Hence the input buffer must be large enough to hold the 1809longest standard marker in the file. Standard JPEG markers should normally 1810not exceed a few hundred bytes each (DHT tables are typically the longest). 1811We recommend at least a 2K buffer for performance reasons, which is much 1812larger than any correct marker is likely to be. For robustness against 1813damaged marker length counts, you may wish to insert a test in your 1814application for the case that the input buffer is completely full and yet 1815the decoder has suspended without consuming any data --- otherwise, if this 1816situation did occur, it would lead to an endless loop. (The library can't 1817provide this test since it has no idea whether "the buffer is full", or 1818even whether there is a fixed-size input buffer.) 1819 1820The input buffer would need to be 64K to allow for arbitrary COM or APPn 1821markers, but these are handled specially: they are either saved into allocated 1822memory, or skipped over by calling skip_input_data(). In the former case, 1823suspension is handled correctly, and in the latter case, the problem of 1824buffer overrun is placed on skip_input_data's shoulders, as explained above. 1825Note that if you provide your own marker handling routine for large markers, 1826you should consider how to deal with buffer overflow. 1827 1828Multiple-buffer management: 1829 1830In some applications it is desirable to store the compressed data in a linked 1831list of buffer areas, so as to avoid data copying. This can be handled by 1832having empty_output_buffer() or fill_input_buffer() set the pointer and count 1833to reference the next available buffer; FALSE is returned only if no more 1834buffers are available. Although seemingly straightforward, there is a 1835pitfall in this approach: the backtrack that occurs when FALSE is returned 1836could back up into an earlier buffer. For example, when fill_input_buffer() 1837is called, the current pointer & count indicate the backtrack restart point. 1838Since fill_input_buffer() will set the pointer and count to refer to a new 1839buffer, the restart position must be saved somewhere else. Suppose a second 1840call to fill_input_buffer() occurs in the same library call, and no 1841additional input data is available, so fill_input_buffer must return FALSE. 1842If the JPEG library has not moved the pointer/count forward in the current 1843buffer, then *the correct restart point is the saved position in the prior 1844buffer*. Prior buffers may be discarded only after the library establishes 1845a restart point within a later buffer. Similar remarks apply for output into 1846a chain of buffers. 1847 1848The library will never attempt to backtrack over a skip_input_data() call, 1849so any skipped data can be permanently discarded. You still have to deal 1850with the case of skipping not-yet-received data, however. 1851 1852It's much simpler to use only a single buffer; when fill_input_buffer() is 1853called, move any unconsumed data (beyond the current pointer/count) down to 1854the beginning of this buffer and then load new data into the remaining buffer 1855space. This approach requires a little more data copying but is far easier 1856to get right. 1857 1858 1859Progressive JPEG support 1860------------------------ 1861 1862Progressive JPEG rearranges the stored data into a series of scans of 1863increasing quality. In situations where a JPEG file is transmitted across a 1864slow communications link, a decoder can generate a low-quality image very 1865quickly from the first scan, then gradually improve the displayed quality as 1866more scans are received. The final image after all scans are complete is 1867identical to that of a regular (sequential) JPEG file of the same quality 1868setting. Progressive JPEG files are often slightly smaller than equivalent 1869sequential JPEG files, but the possibility of incremental display is the main 1870reason for using progressive JPEG. 1871 1872The IJG encoder library generates progressive JPEG files when given a 1873suitable "scan script" defining how to divide the data into scans. 1874Creation of progressive JPEG files is otherwise transparent to the encoder. 1875Progressive JPEG files can also be read transparently by the decoder library. 1876If the decoding application simply uses the library as defined above, it 1877will receive a final decoded image without any indication that the file was 1878progressive. Of course, this approach does not allow incremental display. 1879To perform incremental display, an application needs to use the decoder 1880library's "buffered-image" mode, in which it receives a decoded image 1881multiple times. 1882 1883Each displayed scan requires about as much work to decode as a full JPEG 1884image of the same size, so the decoder must be fairly fast in relation to the 1885data transmission rate in order to make incremental display useful. However, 1886it is possible to skip displaying the image and simply add the incoming bits 1887to the decoder's coefficient buffer. This is fast because only Huffman 1888decoding need be done, not IDCT, upsampling, colorspace conversion, etc. 1889The IJG decoder library allows the application to switch dynamically between 1890displaying the image and simply absorbing the incoming bits. A properly 1891coded application can automatically adapt the number of display passes to 1892suit the time available as the image is received. Also, a final 1893higher-quality display cycle can be performed from the buffered data after 1894the end of the file is reached. 1895 1896Progressive compression: 1897 1898To create a progressive JPEG file (or a multiple-scan sequential JPEG file), 1899set the scan_info cinfo field to point to an array of scan descriptors, and 1900perform compression as usual. Instead of constructing your own scan list, 1901you can call the jpeg_simple_progression() helper routine to create a 1902recommended progression sequence; this method should be used by all 1903applications that don't want to get involved in the nitty-gritty of 1904progressive scan sequence design. (If you want to provide user control of 1905scan sequences, you may wish to borrow the scan script reading code found 1906in rdswitch.c, so that you can read scan script files just like cjpeg's.) 1907When scan_info is not NULL, the compression library will store DCT'd data 1908into a buffer array as jpeg_write_scanlines() is called, and will emit all 1909the requested scans during jpeg_finish_compress(). This implies that 1910multiple-scan output cannot be created with a suspending data destination 1911manager, since jpeg_finish_compress() does not support suspension. We 1912should also note that the compressor currently forces Huffman optimization 1913mode when creating a progressive JPEG file, because the default Huffman 1914tables are unsuitable for progressive files. 1915 1916Progressive decompression: 1917 1918When buffered-image mode is not used, the decoder library will read all of 1919a multi-scan file during jpeg_start_decompress(), so that it can provide a 1920final decoded image. (Here "multi-scan" means either progressive or 1921multi-scan sequential.) This makes multi-scan files transparent to the 1922decoding application. However, existing applications that used suspending 1923input with version 5 of the IJG library will need to be modified to check 1924for a suspension return from jpeg_start_decompress(). 1925 1926To perform incremental display, an application must use the library's 1927buffered-image mode. This is described in the next section. 1928 1929 1930Buffered-image mode 1931------------------- 1932 1933In buffered-image mode, the library stores the partially decoded image in a 1934coefficient buffer, from which it can be read out as many times as desired. 1935This mode is typically used for incremental display of progressive JPEG files, 1936but it can be used with any JPEG file. Each scan of a progressive JPEG file 1937adds more data (more detail) to the buffered image. The application can 1938display in lockstep with the source file (one display pass per input scan), 1939or it can allow input processing to outrun display processing. By making 1940input and display processing run independently, it is possible for the 1941application to adapt progressive display to a wide range of data transmission 1942rates. 1943 1944The basic control flow for buffered-image decoding is 1945 1946 jpeg_create_decompress() 1947 set data source 1948 jpeg_read_header() 1949 set overall decompression parameters 1950 cinfo.buffered_image = TRUE; /* select buffered-image mode */ 1951 jpeg_start_decompress() 1952 for (each output pass) { 1953 adjust output decompression parameters if required 1954 jpeg_start_output() /* start a new output pass */ 1955 for (all scanlines in image) { 1956 jpeg_read_scanlines() 1957 display scanlines 1958 } 1959 jpeg_finish_output() /* terminate output pass */ 1960 } 1961 jpeg_finish_decompress() 1962 jpeg_destroy_decompress() 1963 1964This differs from ordinary unbuffered decoding in that there is an additional 1965level of looping. The application can choose how many output passes to make 1966and how to display each pass. 1967 1968The simplest approach to displaying progressive images is to do one display 1969pass for each scan appearing in the input file. In this case the outer loop 1970condition is typically 1971 while (! jpeg_input_complete(&cinfo)) 1972and the start-output call should read 1973 jpeg_start_output(&cinfo, cinfo.input_scan_number); 1974The second parameter to jpeg_start_output() indicates which scan of the input 1975file is to be displayed; the scans are numbered starting at 1 for this 1976purpose. (You can use a loop counter starting at 1 if you like, but using 1977the library's input scan counter is easier.) The library automatically reads 1978data as necessary to complete each requested scan, and jpeg_finish_output() 1979advances to the next scan or end-of-image marker (hence input_scan_number 1980will be incremented by the time control arrives back at jpeg_start_output()). 1981With this technique, data is read from the input file only as needed, and 1982input and output processing run in lockstep. 1983 1984After reading the final scan and reaching the end of the input file, the 1985buffered image remains available; it can be read additional times by 1986repeating the jpeg_start_output()/jpeg_read_scanlines()/jpeg_finish_output() 1987sequence. For example, a useful technique is to use fast one-pass color 1988quantization for display passes made while the image is arriving, followed by 1989a final display pass using two-pass quantization for highest quality. This 1990is done by changing the library parameters before the final output pass. 1991Changing parameters between passes is discussed in detail below. 1992 1993In general the last scan of a progressive file cannot be recognized as such 1994until after it is read, so a post-input display pass is the best approach if 1995you want special processing in the final pass. 1996 1997When done with the image, be sure to call jpeg_finish_decompress() to release 1998the buffered image (or just use jpeg_destroy_decompress()). 1999 2000If input data arrives faster than it can be displayed, the application can 2001cause the library to decode input data in advance of what's needed to produce 2002output. This is done by calling the routine jpeg_consume_input(). 2003The return value is one of the following: 2004 JPEG_REACHED_SOS: reached an SOS marker (the start of a new scan) 2005 JPEG_REACHED_EOI: reached the EOI marker (end of image) 2006 JPEG_ROW_COMPLETED: completed reading one MCU row of compressed data 2007 JPEG_SCAN_COMPLETED: completed reading last MCU row of current scan 2008 JPEG_SUSPENDED: suspended before completing any of the above 2009(JPEG_SUSPENDED can occur only if a suspending data source is used.) This 2010routine can be called at any time after initializing the JPEG object. It 2011reads some additional data and returns when one of the indicated significant 2012events occurs. (If called after the EOI marker is reached, it will 2013immediately return JPEG_REACHED_EOI without attempting to read more data.) 2014 2015The library's output processing will automatically call jpeg_consume_input() 2016whenever the output processing overtakes the input; thus, simple lockstep 2017display requires no direct calls to jpeg_consume_input(). But by adding 2018calls to jpeg_consume_input(), you can absorb data in advance of what is 2019being displayed. This has two benefits: 2020 * You can limit buildup of unprocessed data in your input buffer. 2021 * You can eliminate extra display passes by paying attention to the 2022 state of the library's input processing. 2023 2024The first of these benefits only requires interspersing calls to 2025jpeg_consume_input() with your display operations and any other processing 2026you may be doing. To avoid wasting cycles due to backtracking, it's best to 2027call jpeg_consume_input() only after a hundred or so new bytes have arrived. 2028This is discussed further under "I/O suspension", above. (Note: the JPEG 2029library currently is not thread-safe. You must not call jpeg_consume_input() 2030from one thread of control if a different library routine is working on the 2031same JPEG object in another thread.) 2032 2033When input arrives fast enough that more than one new scan is available 2034before you start a new output pass, you may as well skip the output pass 2035corresponding to the completed scan. This occurs for free if you pass 2036cinfo.input_scan_number as the target scan number to jpeg_start_output(). 2037The input_scan_number field is simply the index of the scan currently being 2038consumed by the input processor. You can ensure that this is up-to-date by 2039emptying the input buffer just before calling jpeg_start_output(): call 2040jpeg_consume_input() repeatedly until it returns JPEG_SUSPENDED or 2041JPEG_REACHED_EOI. 2042 2043The target scan number passed to jpeg_start_output() is saved in the 2044cinfo.output_scan_number field. The library's output processing calls 2045jpeg_consume_input() whenever the current input scan number and row within 2046that scan is less than or equal to the current output scan number and row. 2047Thus, input processing can "get ahead" of the output processing but is not 2048allowed to "fall behind". You can achieve several different effects by 2049manipulating this interlock rule. For example, if you pass a target scan 2050number greater than the current input scan number, the output processor will 2051wait until that scan starts to arrive before producing any output. (To avoid 2052an infinite loop, the target scan number is automatically reset to the last 2053scan number when the end of image is reached. Thus, if you specify a large 2054target scan number, the library will just absorb the entire input file and 2055then perform an output pass. This is effectively the same as what 2056jpeg_start_decompress() does when you don't select buffered-image mode.) 2057When you pass a target scan number equal to the current input scan number, 2058the image is displayed no faster than the current input scan arrives. The 2059final possibility is to pass a target scan number less than the current input 2060scan number; this disables the input/output interlock and causes the output 2061processor to simply display whatever it finds in the image buffer, without 2062waiting for input. (However, the library will not accept a target scan 2063number less than one, so you can't avoid waiting for the first scan.) 2064 2065When data is arriving faster than the output display processing can advance 2066through the image, jpeg_consume_input() will store data into the buffered 2067image beyond the point at which the output processing is reading data out 2068again. If the input arrives fast enough, it may "wrap around" the buffer to 2069the point where the input is more than one whole scan ahead of the output. 2070If the output processing simply proceeds through its display pass without 2071paying attention to the input, the effect seen on-screen is that the lower 2072part of the image is one or more scans better in quality than the upper part. 2073Then, when the next output scan is started, you have a choice of what target 2074scan number to use. The recommended choice is to use the current input scan 2075number at that time, which implies that you've skipped the output scans 2076corresponding to the input scans that were completed while you processed the 2077previous output scan. In this way, the decoder automatically adapts its 2078speed to the arriving data, by skipping output scans as necessary to keep up 2079with the arriving data. 2080 2081When using this strategy, you'll want to be sure that you perform a final 2082output pass after receiving all the data; otherwise your last display may not 2083be full quality across the whole screen. So the right outer loop logic is 2084something like this: 2085 do { 2086 absorb any waiting input by calling jpeg_consume_input() 2087 final_pass = jpeg_input_complete(&cinfo); 2088 adjust output decompression parameters if required 2089 jpeg_start_output(&cinfo, cinfo.input_scan_number); 2090 ... 2091 jpeg_finish_output() 2092 } while (! final_pass); 2093rather than quitting as soon as jpeg_input_complete() returns TRUE. This 2094arrangement makes it simple to use higher-quality decoding parameters 2095for the final pass. But if you don't want to use special parameters for 2096the final pass, the right loop logic is like this: 2097 for (;;) { 2098 absorb any waiting input by calling jpeg_consume_input() 2099 jpeg_start_output(&cinfo, cinfo.input_scan_number); 2100 ... 2101 jpeg_finish_output() 2102 if (jpeg_input_complete(&cinfo) && 2103 cinfo.input_scan_number == cinfo.output_scan_number) 2104 break; 2105 } 2106In this case you don't need to know in advance whether an output pass is to 2107be the last one, so it's not necessary to have reached EOF before starting 2108the final output pass; rather, what you want to test is whether the output 2109pass was performed in sync with the final input scan. This form of the loop 2110will avoid an extra output pass whenever the decoder is able (or nearly able) 2111to keep up with the incoming data. 2112 2113When the data transmission speed is high, you might begin a display pass, 2114then find that much or all of the file has arrived before you can complete 2115the pass. (You can detect this by noting the JPEG_REACHED_EOI return code 2116from jpeg_consume_input(), or equivalently by testing jpeg_input_complete().) 2117In this situation you may wish to abort the current display pass and start a 2118new one using the newly arrived information. To do so, just call 2119jpeg_finish_output() and then start a new pass with jpeg_start_output(). 2120 2121A variant strategy is to abort and restart display if more than one complete 2122scan arrives during an output pass; this can be detected by noting 2123JPEG_REACHED_SOS returns and/or examining cinfo.input_scan_number. This 2124idea should be employed with caution, however, since the display process 2125might never get to the bottom of the image before being aborted, resulting 2126in the lower part of the screen being several passes worse than the upper. 2127In most cases it's probably best to abort an output pass only if the whole 2128file has arrived and you want to begin the final output pass immediately. 2129 2130When receiving data across a communication link, we recommend always using 2131the current input scan number for the output target scan number; if a 2132higher-quality final pass is to be done, it should be started (aborting any 2133incomplete output pass) as soon as the end of file is received. However, 2134many other strategies are possible. For example, the application can examine 2135the parameters of the current input scan and decide whether to display it or 2136not. If the scan contains only chroma data, one might choose not to use it 2137as the target scan, expecting that the scan will be small and will arrive 2138quickly. To skip to the next scan, call jpeg_consume_input() until it 2139returns JPEG_REACHED_SOS or JPEG_REACHED_EOI. Or just use the next higher 2140number as the target scan for jpeg_start_output(); but that method doesn't 2141let you inspect the next scan's parameters before deciding to display it. 2142 2143 2144In buffered-image mode, jpeg_start_decompress() never performs input and 2145thus never suspends. An application that uses input suspension with 2146buffered-image mode must be prepared for suspension returns from these 2147routines: 2148* jpeg_start_output() performs input only if you request 2-pass quantization 2149 and the target scan isn't fully read yet. (This is discussed below.) 2150* jpeg_read_scanlines(), as always, returns the number of scanlines that it 2151 was able to produce before suspending. 2152* jpeg_finish_output() will read any markers following the target scan, 2153 up to the end of the file or the SOS marker that begins another scan. 2154 (But it reads no input if jpeg_consume_input() has already reached the 2155 end of the file or a SOS marker beyond the target output scan.) 2156* jpeg_finish_decompress() will read until the end of file, and thus can 2157 suspend if the end hasn't already been reached (as can be tested by 2158 calling jpeg_input_complete()). 2159jpeg_start_output(), jpeg_finish_output(), and jpeg_finish_decompress() 2160all return TRUE if they completed their tasks, FALSE if they had to suspend. 2161In the event of a FALSE return, the application must load more input data 2162and repeat the call. Applications that use non-suspending data sources need 2163not check the return values of these three routines. 2164 2165 2166It is possible to change decoding parameters between output passes in the 2167buffered-image mode. The decoder library currently supports only very 2168limited changes of parameters. ONLY THE FOLLOWING parameter changes are 2169allowed after jpeg_start_decompress() is called: 2170* dct_method can be changed before each call to jpeg_start_output(). 2171 For example, one could use a fast DCT method for early scans, changing 2172 to a higher quality method for the final scan. 2173* dither_mode can be changed before each call to jpeg_start_output(); 2174 of course this has no impact if not using color quantization. Typically 2175 one would use ordered dither for initial passes, then switch to 2176 Floyd-Steinberg dither for the final pass. Caution: changing dither mode 2177 can cause more memory to be allocated by the library. Although the amount 2178 of memory involved is not large (a scanline or so), it may cause the 2179 initial max_memory_to_use specification to be exceeded, which in the worst 2180 case would result in an out-of-memory failure. 2181* do_block_smoothing can be changed before each call to jpeg_start_output(). 2182 This setting is relevant only when decoding a progressive JPEG image. 2183 During the first DC-only scan, block smoothing provides a very "fuzzy" look 2184 instead of the very "blocky" look seen without it; which is better seems a 2185 matter of personal taste. But block smoothing is nearly always a win 2186 during later stages, especially when decoding a successive-approximation 2187 image: smoothing helps to hide the slight blockiness that otherwise shows 2188 up on smooth gradients until the lowest coefficient bits are sent. 2189* Color quantization mode can be changed under the rules described below. 2190 You *cannot* change between full-color and quantized output (because that 2191 would alter the required I/O buffer sizes), but you can change which 2192 quantization method is used. 2193 2194When generating color-quantized output, changing quantization method is a 2195very useful way of switching between high-speed and high-quality display. 2196The library allows you to change among its three quantization methods: 21971. Single-pass quantization to a fixed color cube. 2198 Selected by cinfo.two_pass_quantize = FALSE and cinfo.colormap = NULL. 21992. Single-pass quantization to an application-supplied colormap. 2200 Selected by setting cinfo.colormap to point to the colormap (the value of 2201 two_pass_quantize is ignored); also set cinfo.actual_number_of_colors. 22023. Two-pass quantization to a colormap chosen specifically for the image. 2203 Selected by cinfo.two_pass_quantize = TRUE and cinfo.colormap = NULL. 2204 (This is the default setting selected by jpeg_read_header, but it is 2205 probably NOT what you want for the first pass of progressive display!) 2206These methods offer successively better quality and lesser speed. However, 2207only the first method is available for quantizing in non-RGB color spaces. 2208 2209IMPORTANT: because the different quantizer methods have very different 2210working-storage requirements, the library requires you to indicate which 2211one(s) you intend to use before you call jpeg_start_decompress(). (If we did 2212not require this, the max_memory_to_use setting would be a complete fiction.) 2213You do this by setting one or more of these three cinfo fields to TRUE: 2214 enable_1pass_quant Fixed color cube colormap 2215 enable_external_quant Externally-supplied colormap 2216 enable_2pass_quant Two-pass custom colormap 2217All three are initialized FALSE by jpeg_read_header(). But 2218jpeg_start_decompress() automatically sets TRUE the one selected by the 2219current two_pass_quantize and colormap settings, so you only need to set the 2220enable flags for any other quantization methods you plan to change to later. 2221 2222After setting the enable flags correctly at jpeg_start_decompress() time, you 2223can change to any enabled quantization method by setting two_pass_quantize 2224and colormap properly just before calling jpeg_start_output(). The following 2225special rules apply: 22261. You must explicitly set cinfo.colormap to NULL when switching to 1-pass 2227 or 2-pass mode from a different mode, or when you want the 2-pass 2228 quantizer to be re-run to generate a new colormap. 22292. To switch to an external colormap, or to change to a different external 2230 colormap than was used on the prior pass, you must call 2231 jpeg_new_colormap() after setting cinfo.colormap. 2232NOTE: if you want to use the same colormap as was used in the prior pass, 2233you should not do either of these things. This will save some nontrivial 2234switchover costs. 2235(These requirements exist because cinfo.colormap will always be non-NULL 2236after completing a prior output pass, since both the 1-pass and 2-pass 2237quantizers set it to point to their output colormaps. Thus you have to 2238do one of these two things to notify the library that something has changed. 2239Yup, it's a bit klugy, but it's necessary to do it this way for backwards 2240compatibility.) 2241 2242Note that in buffered-image mode, the library generates any requested colormap 2243during jpeg_start_output(), not during jpeg_start_decompress(). 2244 2245When using two-pass quantization, jpeg_start_output() makes a pass over the 2246buffered image to determine the optimum color map; it therefore may take a 2247significant amount of time, whereas ordinarily it does little work. The 2248progress monitor hook is called during this pass, if defined. It is also 2249important to realize that if the specified target scan number is greater than 2250or equal to the current input scan number, jpeg_start_output() will attempt 2251to consume input as it makes this pass. If you use a suspending data source, 2252you need to check for a FALSE return from jpeg_start_output() under these 2253conditions. The combination of 2-pass quantization and a not-yet-fully-read 2254target scan is the only case in which jpeg_start_output() will consume input. 2255 2256 2257Application authors who support buffered-image mode may be tempted to use it 2258for all JPEG images, even single-scan ones. This will work, but it is 2259inefficient: there is no need to create an image-sized coefficient buffer for 2260single-scan images. Requesting buffered-image mode for such an image wastes 2261memory. Worse, it can cost time on large images, since the buffered data has 2262to be swapped out or written to a temporary file. If you are concerned about 2263maximum performance on baseline JPEG files, you should use buffered-image 2264mode only when the incoming file actually has multiple scans. This can be 2265tested by calling jpeg_has_multiple_scans(), which will return a correct 2266result at any time after jpeg_read_header() completes. 2267 2268It is also worth noting that when you use jpeg_consume_input() to let input 2269processing get ahead of output processing, the resulting pattern of access to 2270the coefficient buffer is quite nonsequential. It's best to use the memory 2271manager jmemnobs.c if you can (ie, if you have enough real or virtual main 2272memory). If not, at least make sure that max_memory_to_use is set as high as 2273possible. If the JPEG memory manager has to use a temporary file, you will 2274probably see a lot of disk traffic and poor performance. (This could be 2275improved with additional work on the memory manager, but we haven't gotten 2276around to it yet.) 2277 2278In some applications it may be convenient to use jpeg_consume_input() for all 2279input processing, including reading the initial markers; that is, you may 2280wish to call jpeg_consume_input() instead of jpeg_read_header() during 2281startup. This works, but note that you must check for JPEG_REACHED_SOS and 2282JPEG_REACHED_EOI return codes as the equivalent of jpeg_read_header's codes. 2283Once the first SOS marker has been reached, you must call 2284jpeg_start_decompress() before jpeg_consume_input() will consume more input; 2285it'll just keep returning JPEG_REACHED_SOS until you do. If you read a 2286tables-only file this way, jpeg_consume_input() will return JPEG_REACHED_EOI 2287without ever returning JPEG_REACHED_SOS; be sure to check for this case. 2288If this happens, the decompressor will not read any more input until you call 2289jpeg_abort() to reset it. It is OK to call jpeg_consume_input() even when not 2290using buffered-image mode, but in that case it's basically a no-op after the 2291initial markers have been read: it will just return JPEG_SUSPENDED. 2292 2293 2294Abbreviated datastreams and multiple images 2295------------------------------------------- 2296 2297A JPEG compression or decompression object can be reused to process multiple 2298images. This saves a small amount of time per image by eliminating the 2299"create" and "destroy" operations, but that isn't the real purpose of the 2300feature. Rather, reuse of an object provides support for abbreviated JPEG 2301datastreams. Object reuse can also simplify processing a series of images in 2302a single input or output file. This section explains these features. 2303 2304A JPEG file normally contains several hundred bytes worth of quantization 2305and Huffman tables. In a situation where many images will be stored or 2306transmitted with identical tables, this may represent an annoying overhead. 2307The JPEG standard therefore permits tables to be omitted. The standard 2308defines three classes of JPEG datastreams: 2309 * "Interchange" datastreams contain an image and all tables needed to decode 2310 the image. These are the usual kind of JPEG file. 2311 * "Abbreviated image" datastreams contain an image, but are missing some or 2312 all of the tables needed to decode that image. 2313 * "Abbreviated table specification" (henceforth "tables-only") datastreams 2314 contain only table specifications. 2315To decode an abbreviated image, it is necessary to load the missing table(s) 2316into the decoder beforehand. This can be accomplished by reading a separate 2317tables-only file. A variant scheme uses a series of images in which the first 2318image is an interchange (complete) datastream, while subsequent ones are 2319abbreviated and rely on the tables loaded by the first image. It is assumed 2320that once the decoder has read a table, it will remember that table until a 2321new definition for the same table number is encountered. 2322 2323It is the application designer's responsibility to figure out how to associate 2324the correct tables with an abbreviated image. While abbreviated datastreams 2325can be useful in a closed environment, their use is strongly discouraged in 2326any situation where data exchange with other applications might be needed. 2327Caveat designer. 2328 2329The JPEG library provides support for reading and writing any combination of 2330tables-only datastreams and abbreviated images. In both compression and 2331decompression objects, a quantization or Huffman table will be retained for 2332the lifetime of the object, unless it is overwritten by a new table definition. 2333 2334 2335To create abbreviated image datastreams, it is only necessary to tell the 2336compressor not to emit some or all of the tables it is using. Each 2337quantization and Huffman table struct contains a boolean field "sent_table", 2338which normally is initialized to FALSE. For each table used by the image, the 2339header-writing process emits the table and sets sent_table = TRUE unless it is 2340already TRUE. (In normal usage, this prevents outputting the same table 2341definition multiple times, as would otherwise occur because the chroma 2342components typically share tables.) Thus, setting this field to TRUE before 2343calling jpeg_start_compress() will prevent the table from being written at 2344all. 2345 2346If you want to create a "pure" abbreviated image file containing no tables, 2347just call "jpeg_suppress_tables(&cinfo, TRUE)" after constructing all the 2348tables. If you want to emit some but not all tables, you'll need to set the 2349individual sent_table fields directly. 2350 2351To create an abbreviated image, you must also call jpeg_start_compress() 2352with a second parameter of FALSE, not TRUE. Otherwise jpeg_start_compress() 2353will force all the sent_table fields to FALSE. (This is a safety feature to 2354prevent abbreviated images from being created accidentally.) 2355 2356To create a tables-only file, perform the same parameter setup that you 2357normally would, but instead of calling jpeg_start_compress() and so on, call 2358jpeg_write_tables(&cinfo). This will write an abbreviated datastream 2359containing only SOI, DQT and/or DHT markers, and EOI. All the quantization 2360and Huffman tables that are currently defined in the compression object will 2361be emitted unless their sent_tables flag is already TRUE, and then all the 2362sent_tables flags will be set TRUE. 2363 2364A sure-fire way to create matching tables-only and abbreviated image files 2365is to proceed as follows: 2366 2367 create JPEG compression object 2368 set JPEG parameters 2369 set destination to tables-only file 2370 jpeg_write_tables(&cinfo); 2371 set destination to image file 2372 jpeg_start_compress(&cinfo, FALSE); 2373 write data... 2374 jpeg_finish_compress(&cinfo); 2375 2376Since the JPEG parameters are not altered between writing the table file and 2377the abbreviated image file, the same tables are sure to be used. Of course, 2378you can repeat the jpeg_start_compress() ... jpeg_finish_compress() sequence 2379many times to produce many abbreviated image files matching the table file. 2380 2381You cannot suppress output of the computed Huffman tables when Huffman 2382optimization is selected. (If you could, there'd be no way to decode the 2383image...) Generally, you don't want to set optimize_coding = TRUE when 2384you are trying to produce abbreviated files. 2385 2386In some cases you might want to compress an image using tables which are 2387not stored in the application, but are defined in an interchange or 2388tables-only file readable by the application. This can be done by setting up 2389a JPEG decompression object to read the specification file, then copying the 2390tables into your compression object. See jpeg_copy_critical_parameters() 2391for an example of copying quantization tables. 2392 2393 2394To read abbreviated image files, you simply need to load the proper tables 2395into the decompression object before trying to read the abbreviated image. 2396If the proper tables are stored in the application program, you can just 2397allocate the table structs and fill in their contents directly. For example, 2398to load a fixed quantization table into table slot "n": 2399 2400 if (cinfo.quant_tbl_ptrs[n] == NULL) 2401 cinfo.quant_tbl_ptrs[n] = jpeg_alloc_quant_table((j_common_ptr) &cinfo); 2402 quant_ptr = cinfo.quant_tbl_ptrs[n]; /* quant_ptr is JQUANT_TBL* */ 2403 for (i = 0; i < 64; i++) { 2404 /* Qtable[] is desired quantization table, in natural array order */ 2405 quant_ptr->quantval[i] = Qtable[i]; 2406 } 2407 2408Code to load a fixed Huffman table is typically (for AC table "n"): 2409 2410 if (cinfo.ac_huff_tbl_ptrs[n] == NULL) 2411 cinfo.ac_huff_tbl_ptrs[n] = jpeg_alloc_huff_table((j_common_ptr) &cinfo); 2412 huff_ptr = cinfo.ac_huff_tbl_ptrs[n]; /* huff_ptr is JHUFF_TBL* */ 2413 for (i = 1; i <= 16; i++) { 2414 /* counts[i] is number of Huffman codes of length i bits, i=1..16 */ 2415 huff_ptr->bits[i] = counts[i]; 2416 } 2417 for (i = 0; i < 256; i++) { 2418 /* symbols[] is the list of Huffman symbols, in code-length order */ 2419 huff_ptr->huffval[i] = symbols[i]; 2420 } 2421 2422(Note that trying to set cinfo.quant_tbl_ptrs[n] to point directly at a 2423constant JQUANT_TBL object is not safe. If the incoming file happened to 2424contain a quantization table definition, your master table would get 2425overwritten! Instead allocate a working table copy and copy the master table 2426into it, as illustrated above. Ditto for Huffman tables, of course.) 2427 2428You might want to read the tables from a tables-only file, rather than 2429hard-wiring them into your application. The jpeg_read_header() call is 2430sufficient to read a tables-only file. You must pass a second parameter of 2431FALSE to indicate that you do not require an image to be present. Thus, the 2432typical scenario is 2433 2434 create JPEG decompression object 2435 set source to tables-only file 2436 jpeg_read_header(&cinfo, FALSE); 2437 set source to abbreviated image file 2438 jpeg_read_header(&cinfo, TRUE); 2439 set decompression parameters 2440 jpeg_start_decompress(&cinfo); 2441 read data... 2442 jpeg_finish_decompress(&cinfo); 2443 2444In some cases, you may want to read a file without knowing whether it contains 2445an image or just tables. In that case, pass FALSE and check the return value 2446from jpeg_read_header(): it will be JPEG_HEADER_OK if an image was found, 2447JPEG_HEADER_TABLES_ONLY if only tables were found. (A third return value, 2448JPEG_SUSPENDED, is possible when using a suspending data source manager.) 2449Note that jpeg_read_header() will not complain if you read an abbreviated 2450image for which you haven't loaded the missing tables; the missing-table check 2451occurs later, in jpeg_start_decompress(). 2452 2453 2454It is possible to read a series of images from a single source file by 2455repeating the jpeg_read_header() ... jpeg_finish_decompress() sequence, 2456without releasing/recreating the JPEG object or the data source module. 2457(If you did reinitialize, any partial bufferload left in the data source 2458buffer at the end of one image would be discarded, causing you to lose the 2459start of the next image.) When you use this method, stored tables are 2460automatically carried forward, so some of the images can be abbreviated images 2461that depend on tables from earlier images. 2462 2463If you intend to write a series of images into a single destination file, 2464you might want to make a specialized data destination module that doesn't 2465flush the output buffer at term_destination() time. This would speed things 2466up by some trifling amount. Of course, you'd need to remember to flush the 2467buffer after the last image. You can make the later images be abbreviated 2468ones by passing FALSE to jpeg_start_compress(). 2469 2470 2471Special markers 2472--------------- 2473 2474Some applications may need to insert or extract special data in the JPEG 2475datastream. The JPEG standard provides marker types "COM" (comment) and 2476"APP0" through "APP15" (application) to hold application-specific data. 2477Unfortunately, the use of these markers is not specified by the standard. 2478COM markers are fairly widely used to hold user-supplied text. The JFIF file 2479format spec uses APP0 markers with specified initial strings to hold certain 2480data. Adobe applications use APP14 markers beginning with the string "Adobe" 2481for miscellaneous data. Other APPn markers are rarely seen, but might 2482contain almost anything. 2483 2484If you wish to store user-supplied text, we recommend you use COM markers 2485and place readable 7-bit ASCII text in them. Newline conventions are not 2486standardized --- expect to find LF (Unix style), CR/LF (DOS style), or CR 2487(Mac style). A robust COM reader should be able to cope with random binary 2488garbage, including nulls, since some applications generate COM markers 2489containing non-ASCII junk. (But yours should not be one of them.) 2490 2491For program-supplied data, use an APPn marker, and be sure to begin it with an 2492identifying string so that you can tell whether the marker is actually yours. 2493It's probably best to avoid using APP0 or APP14 for any private markers. 2494(NOTE: the upcoming SPIFF standard will use APP8 markers; we recommend you 2495not use APP8 markers for any private purposes, either.) 2496 2497Keep in mind that at most 65533 bytes can be put into one marker, but you 2498can have as many markers as you like. 2499 2500By default, the IJG compression library will write a JFIF APP0 marker if the 2501selected JPEG colorspace is grayscale or YCbCr, or an Adobe APP14 marker if 2502the selected colorspace is RGB, CMYK, or YCCK. You can disable this, but 2503we don't recommend it. The decompression library will recognize JFIF and 2504Adobe markers and will set the JPEG colorspace properly when one is found. 2505 2506 2507You can write special markers immediately following the datastream header by 2508calling jpeg_write_marker() after jpeg_start_compress() and before the first 2509call to jpeg_write_scanlines(). When you do this, the markers appear after 2510the SOI and the JFIF APP0 and Adobe APP14 markers (if written), but before 2511all else. Specify the marker type parameter as "JPEG_COM" for COM or 2512"JPEG_APP0 + n" for APPn. (Actually, jpeg_write_marker will let you write 2513any marker type, but we don't recommend writing any other kinds of marker.) 2514For example, to write a user comment string pointed to by comment_text: 2515 jpeg_write_marker(cinfo, JPEG_COM, comment_text, strlen(comment_text)); 2516 2517If it's not convenient to store all the marker data in memory at once, 2518you can instead call jpeg_write_m_header() followed by multiple calls to 2519jpeg_write_m_byte(). If you do it this way, it's your responsibility to 2520call jpeg_write_m_byte() exactly the number of times given in the length 2521parameter to jpeg_write_m_header(). (This method lets you empty the 2522output buffer partway through a marker, which might be important when 2523using a suspending data destination module. In any case, if you are using 2524a suspending destination, you should flush its buffer after inserting 2525any special markers. See "I/O suspension".) 2526 2527Or, if you prefer to synthesize the marker byte sequence yourself, 2528you can just cram it straight into the data destination module. 2529 2530If you are writing JFIF 1.02 extension markers (thumbnail images), don't 2531forget to set cinfo.JFIF_minor_version = 2 so that the encoder will write the 2532correct JFIF version number in the JFIF header marker. The library's default 2533is to write version 1.01, but that's wrong if you insert any 1.02 extension 2534markers. (We could probably get away with just defaulting to 1.02, but there 2535used to be broken decoders that would complain about unknown minor version 2536numbers. To reduce compatibility risks it's safest not to write 1.02 unless 2537you are actually using 1.02 extensions.) 2538 2539 2540When reading, two methods of handling special markers are available: 25411. You can ask the library to save the contents of COM and/or APPn markers 2542into memory, and then examine them at your leisure afterwards. 25432. You can supply your own routine to process COM and/or APPn markers 2544on-the-fly as they are read. 2545The first method is simpler to use, especially if you are using a suspending 2546data source; writing a marker processor that copes with input suspension is 2547not easy (consider what happens if the marker is longer than your available 2548input buffer). However, the second method conserves memory since the marker 2549data need not be kept around after it's been processed. 2550 2551For either method, you'd normally set up marker handling after creating a 2552decompression object and before calling jpeg_read_header(), because the 2553markers of interest will typically be near the head of the file and so will 2554be scanned by jpeg_read_header. Once you've established a marker handling 2555method, it will be used for the life of that decompression object 2556(potentially many datastreams), unless you change it. Marker handling is 2557determined separately for COM markers and for each APPn marker code. 2558 2559 2560To save the contents of special markers in memory, call 2561 jpeg_save_markers(cinfo, marker_code, length_limit) 2562where marker_code is the marker type to save, JPEG_COM or JPEG_APP0+n. 2563(To arrange to save all the special marker types, you need to call this 2564routine 17 times, for COM and APP0-APP15.) If the incoming marker is longer 2565than length_limit data bytes, only length_limit bytes will be saved; this 2566parameter allows you to avoid chewing up memory when you only need to see the 2567first few bytes of a potentially large marker. If you want to save all the 2568data, set length_limit to 0xFFFF; that is enough since marker lengths are only 256916 bits. As a special case, setting length_limit to 0 prevents that marker 2570type from being saved at all. (That is the default behavior, in fact.) 2571 2572After jpeg_read_header() completes, you can examine the special markers by 2573following the cinfo->marker_list pointer chain. All the special markers in 2574the file appear in this list, in order of their occurrence in the file (but 2575omitting any markers of types you didn't ask for). Both the original data 2576length and the saved data length are recorded for each list entry; the latter 2577will not exceed length_limit for the particular marker type. Note that these 2578lengths exclude the marker length word, whereas the stored representation 2579within the JPEG file includes it. (Hence the maximum data length is really 2580only 65533.) 2581 2582It is possible that additional special markers appear in the file beyond the 2583SOS marker at which jpeg_read_header stops; if so, the marker list will be 2584extended during reading of the rest of the file. This is not expected to be 2585common, however. If you are short on memory you may want to reset the length 2586limit to zero for all marker types after finishing jpeg_read_header, to 2587ensure that the max_memory_to_use setting cannot be exceeded due to addition 2588of later markers. 2589 2590The marker list remains stored until you call jpeg_finish_decompress or 2591jpeg_abort, at which point the memory is freed and the list is set to empty. 2592(jpeg_destroy also releases the storage, of course.) 2593 2594Note that the library is internally interested in APP0 and APP14 markers; 2595if you try to set a small nonzero length limit on these types, the library 2596will silently force the length up to the minimum it wants. (But you can set 2597a zero length limit to prevent them from being saved at all.) Also, in a 259816-bit environment, the maximum length limit may be constrained to less than 259965533 by malloc() limitations. It is therefore best not to assume that the 2600effective length limit is exactly what you set it to be. 2601 2602 2603If you want to supply your own marker-reading routine, you do it by calling 2604jpeg_set_marker_processor(). A marker processor routine must have the 2605signature 2606 boolean jpeg_marker_parser_method (j_decompress_ptr cinfo) 2607Although the marker code is not explicitly passed, the routine can find it 2608in cinfo->unread_marker. At the time of call, the marker proper has been 2609read from the data source module. The processor routine is responsible for 2610reading the marker length word and the remaining parameter bytes, if any. 2611Return TRUE to indicate success. (FALSE should be returned only if you are 2612using a suspending data source and it tells you to suspend. See the standard 2613marker processors in jdmarker.c for appropriate coding methods if you need to 2614use a suspending data source.) 2615 2616If you override the default APP0 or APP14 processors, it is up to you to 2617recognize JFIF and Adobe markers if you want colorspace recognition to occur 2618properly. We recommend copying and extending the default processors if you 2619want to do that. (A better idea is to save these marker types for later 2620examination by calling jpeg_save_markers(); that method doesn't interfere 2621with the library's own processing of these markers.) 2622 2623jpeg_set_marker_processor() and jpeg_save_markers() are mutually exclusive 2624--- if you call one it overrides any previous call to the other, for the 2625particular marker type specified. 2626 2627A simple example of an external COM processor can be found in djpeg.c. 2628Also, see jpegtran.c for an example of using jpeg_save_markers. 2629 2630 2631Raw (downsampled) image data 2632---------------------------- 2633 2634Some applications need to supply already-downsampled image data to the JPEG 2635compressor, or to receive raw downsampled data from the decompressor. The 2636library supports this requirement by allowing the application to write or 2637read raw data, bypassing the normal preprocessing or postprocessing steps. 2638The interface is different from the standard one and is somewhat harder to 2639use. If your interest is merely in bypassing color conversion, we recommend 2640that you use the standard interface and simply set jpeg_color_space = 2641in_color_space (or jpeg_color_space = out_color_space for decompression). 2642The mechanism described in this section is necessary only to supply or 2643receive downsampled image data, in which not all components have the same 2644dimensions. 2645 2646 2647To compress raw data, you must supply the data in the colorspace to be used 2648in the JPEG file (please read the earlier section on Special color spaces) 2649and downsampled to the sampling factors specified in the JPEG parameters. 2650You must supply the data in the format used internally by the JPEG library, 2651namely a JSAMPIMAGE array. This is an array of pointers to two-dimensional 2652arrays, each of type JSAMPARRAY. Each 2-D array holds the values for one 2653color component. This structure is necessary since the components are of 2654different sizes. If the image dimensions are not a multiple of the MCU size, 2655you must also pad the data correctly (usually, this is done by replicating 2656the last column and/or row). The data must be padded to a multiple of a DCT 2657block in each component: that is, each downsampled row must contain a 2658multiple of 8 valid samples, and there must be a multiple of 8 sample rows 2659for each component. (For applications such as conversion of digital TV 2660images, the standard image size is usually a multiple of the DCT block size, 2661so that no padding need actually be done.) 2662 2663The procedure for compression of raw data is basically the same as normal 2664compression, except that you call jpeg_write_raw_data() in place of 2665jpeg_write_scanlines(). Before calling jpeg_start_compress(), you must do 2666the following: 2667 * Set cinfo->raw_data_in to TRUE. (It is set FALSE by jpeg_set_defaults().) 2668 This notifies the library that you will be supplying raw data. 2669 * Ensure jpeg_color_space is correct --- an explicit jpeg_set_colorspace() 2670 call is a good idea. Note that since color conversion is bypassed, 2671 in_color_space is ignored, except that jpeg_set_defaults() uses it to 2672 choose the default jpeg_color_space setting. 2673 * Ensure the sampling factors, cinfo->comp_info[i].h_samp_factor and 2674 cinfo->comp_info[i].v_samp_factor, are correct. Since these indicate the 2675 dimensions of the data you are supplying, it's wise to set them 2676 explicitly, rather than assuming the library's defaults are what you want. 2677 2678To pass raw data to the library, call jpeg_write_raw_data() in place of 2679jpeg_write_scanlines(). The two routines work similarly except that 2680jpeg_write_raw_data takes a JSAMPIMAGE data array rather than JSAMPARRAY. 2681The scanlines count passed to and returned from jpeg_write_raw_data is 2682measured in terms of the component with the largest v_samp_factor. 2683 2684jpeg_write_raw_data() processes one MCU row per call, which is to say 2685v_samp_factor*DCTSIZE sample rows of each component. The passed num_lines 2686value must be at least max_v_samp_factor*DCTSIZE, and the return value will 2687be exactly that amount (or possibly some multiple of that amount, in future 2688library versions). This is true even on the last call at the bottom of the 2689image; don't forget to pad your data as necessary. 2690 2691The required dimensions of the supplied data can be computed for each 2692component as 2693 cinfo->comp_info[i].width_in_blocks*DCTSIZE samples per row 2694 cinfo->comp_info[i].height_in_blocks*DCTSIZE rows in image 2695after jpeg_start_compress() has initialized those fields. If the valid data 2696is smaller than this, it must be padded appropriately. For some sampling 2697factors and image sizes, additional dummy DCT blocks are inserted to make 2698the image a multiple of the MCU dimensions. The library creates such dummy 2699blocks itself; it does not read them from your supplied data. Therefore you 2700need never pad by more than DCTSIZE samples. An example may help here. 2701Assume 2h2v downsampling of YCbCr data, that is 2702 cinfo->comp_info[0].h_samp_factor = 2 for Y 2703 cinfo->comp_info[0].v_samp_factor = 2 2704 cinfo->comp_info[1].h_samp_factor = 1 for Cb 2705 cinfo->comp_info[1].v_samp_factor = 1 2706 cinfo->comp_info[2].h_samp_factor = 1 for Cr 2707 cinfo->comp_info[2].v_samp_factor = 1 2708and suppose that the nominal image dimensions (cinfo->image_width and 2709cinfo->image_height) are 101x101 pixels. Then jpeg_start_compress() will 2710compute downsampled_width = 101 and width_in_blocks = 13 for Y, 2711downsampled_width = 51 and width_in_blocks = 7 for Cb and Cr (and the same 2712for the height fields). You must pad the Y data to at least 13*8 = 104 2713columns and rows, the Cb/Cr data to at least 7*8 = 56 columns and rows. The 2714MCU height is max_v_samp_factor = 2 DCT rows so you must pass at least 16 2715scanlines on each call to jpeg_write_raw_data(), which is to say 16 actual 2716sample rows of Y and 8 each of Cb and Cr. A total of 7 MCU rows are needed, 2717so you must pass a total of 7*16 = 112 "scanlines". The last DCT block row 2718of Y data is dummy, so it doesn't matter what you pass for it in the data 2719arrays, but the scanlines count must total up to 112 so that all of the Cb 2720and Cr data gets passed. 2721 2722Output suspension is supported with raw-data compression: if the data 2723destination module suspends, jpeg_write_raw_data() will return 0. 2724In this case the same data rows must be passed again on the next call. 2725 2726 2727Decompression with raw data output implies bypassing all postprocessing: 2728you cannot ask for rescaling or color quantization, for instance. More 2729seriously, you must deal with the color space and sampling factors present in 2730the incoming file. If your application only handles, say, 2h1v YCbCr data, 2731you must check for and fail on other color spaces or other sampling factors. 2732The library will not convert to a different color space for you. 2733 2734To obtain raw data output, set cinfo->raw_data_out = TRUE before 2735jpeg_start_decompress() (it is set FALSE by jpeg_read_header()). Be sure to 2736verify that the color space and sampling factors are ones you can handle. 2737Then call jpeg_read_raw_data() in place of jpeg_read_scanlines(). The 2738decompression process is otherwise the same as usual. 2739 2740jpeg_read_raw_data() returns one MCU row per call, and thus you must pass a 2741buffer of at least max_v_samp_factor*DCTSIZE scanlines (scanline counting is 2742the same as for raw-data compression). The buffer you pass must be large 2743enough to hold the actual data plus padding to DCT-block boundaries. As with 2744compression, any entirely dummy DCT blocks are not processed so you need not 2745allocate space for them, but the total scanline count includes them. The 2746above example of computing buffer dimensions for raw-data compression is 2747equally valid for decompression. 2748 2749Input suspension is supported with raw-data decompression: if the data source 2750module suspends, jpeg_read_raw_data() will return 0. You can also use 2751buffered-image mode to read raw data in multiple passes. 2752 2753 2754Really raw data: DCT coefficients 2755--------------------------------- 2756 2757It is possible to read or write the contents of a JPEG file as raw DCT 2758coefficients. This facility is mainly intended for use in lossless 2759transcoding between different JPEG file formats. Other possible applications 2760include lossless cropping of a JPEG image, lossless reassembly of a 2761multi-strip or multi-tile TIFF/JPEG file into a single JPEG datastream, etc. 2762 2763To read the contents of a JPEG file as DCT coefficients, open the file and do 2764jpeg_read_header() as usual. But instead of calling jpeg_start_decompress() 2765and jpeg_read_scanlines(), call jpeg_read_coefficients(). This will read the 2766entire image into a set of virtual coefficient-block arrays, one array per 2767component. The return value is a pointer to an array of virtual-array 2768descriptors. Each virtual array can be accessed directly using the JPEG 2769memory manager's access_virt_barray method (see Memory management, below, 2770and also read structure.txt's discussion of virtual array handling). Or, 2771for simple transcoding to a different JPEG file format, the array list can 2772just be handed directly to jpeg_write_coefficients(). 2773 2774Each block in the block arrays contains quantized coefficient values in 2775normal array order (not JPEG zigzag order). The block arrays contain only 2776DCT blocks containing real data; any entirely-dummy blocks added to fill out 2777interleaved MCUs at the right or bottom edges of the image are discarded 2778during reading and are not stored in the block arrays. (The size of each 2779block array can be determined from the width_in_blocks and height_in_blocks 2780fields of the component's comp_info entry.) This is also the data format 2781expected by jpeg_write_coefficients(). 2782 2783When you are done using the virtual arrays, call jpeg_finish_decompress() 2784to release the array storage and return the decompression object to an idle 2785state; or just call jpeg_destroy() if you don't need to reuse the object. 2786 2787If you use a suspending data source, jpeg_read_coefficients() will return 2788NULL if it is forced to suspend; a non-NULL return value indicates successful 2789completion. You need not test for a NULL return value when using a 2790non-suspending data source. 2791 2792It is also possible to call jpeg_read_coefficients() to obtain access to the 2793decoder's coefficient arrays during a normal decode cycle in buffered-image 2794mode. This frammish might be useful for progressively displaying an incoming 2795image and then re-encoding it without loss. To do this, decode in buffered- 2796image mode as discussed previously, then call jpeg_read_coefficients() after 2797the last jpeg_finish_output() call. The arrays will be available for your use 2798until you call jpeg_finish_decompress(). 2799 2800 2801To write the contents of a JPEG file as DCT coefficients, you must provide 2802the DCT coefficients stored in virtual block arrays. You can either pass 2803block arrays read from an input JPEG file by jpeg_read_coefficients(), or 2804allocate virtual arrays from the JPEG compression object and fill them 2805yourself. In either case, jpeg_write_coefficients() is substituted for 2806jpeg_start_compress() and jpeg_write_scanlines(). Thus the sequence is 2807 * Create compression object 2808 * Set all compression parameters as necessary 2809 * Request virtual arrays if needed 2810 * jpeg_write_coefficients() 2811 * jpeg_finish_compress() 2812 * Destroy or re-use compression object 2813jpeg_write_coefficients() is passed a pointer to an array of virtual block 2814array descriptors; the number of arrays is equal to cinfo.num_components. 2815 2816The virtual arrays need only have been requested, not realized, before 2817jpeg_write_coefficients() is called. A side-effect of 2818jpeg_write_coefficients() is to realize any virtual arrays that have been 2819requested from the compression object's memory manager. Thus, when obtaining 2820the virtual arrays from the compression object, you should fill the arrays 2821after calling jpeg_write_coefficients(). The data is actually written out 2822when you call jpeg_finish_compress(); jpeg_write_coefficients() only writes 2823the file header. 2824 2825When writing raw DCT coefficients, it is crucial that the JPEG quantization 2826tables and sampling factors match the way the data was encoded, or the 2827resulting file will be invalid. For transcoding from an existing JPEG file, 2828we recommend using jpeg_copy_critical_parameters(). This routine initializes 2829all the compression parameters to default values (like jpeg_set_defaults()), 2830then copies the critical information from a source decompression object. 2831The decompression object should have just been used to read the entire 2832JPEG input file --- that is, it should be awaiting jpeg_finish_decompress(). 2833 2834jpeg_write_coefficients() marks all tables stored in the compression object 2835as needing to be written to the output file (thus, it acts like 2836jpeg_start_compress(cinfo, TRUE)). This is for safety's sake, to avoid 2837emitting abbreviated JPEG files by accident. If you really want to emit an 2838abbreviated JPEG file, call jpeg_suppress_tables(), or set the tables' 2839individual sent_table flags, between calling jpeg_write_coefficients() and 2840jpeg_finish_compress(). 2841 2842 2843Progress monitoring 2844------------------- 2845 2846Some applications may need to regain control from the JPEG library every so 2847often. The typical use of this feature is to produce a percent-done bar or 2848other progress display. (For a simple example, see cjpeg.c or djpeg.c.) 2849Although you do get control back frequently during the data-transferring pass 2850(the jpeg_read_scanlines or jpeg_write_scanlines loop), any additional passes 2851will occur inside jpeg_finish_compress or jpeg_start_decompress; those 2852routines may take a long time to execute, and you don't get control back 2853until they are done. 2854 2855You can define a progress-monitor routine which will be called periodically 2856by the library. No guarantees are made about how often this call will occur, 2857so we don't recommend you use it for mouse tracking or anything like that. 2858At present, a call will occur once per MCU row, scanline, or sample row 2859group, whichever unit is convenient for the current processing mode; so the 2860wider the image, the longer the time between calls. During the data 2861transferring pass, only one call occurs per call of jpeg_read_scanlines or 2862jpeg_write_scanlines, so don't pass a large number of scanlines at once if 2863you want fine resolution in the progress count. (If you really need to use 2864the callback mechanism for time-critical tasks like mouse tracking, you could 2865insert additional calls inside some of the library's inner loops.) 2866 2867To establish a progress-monitor callback, create a struct jpeg_progress_mgr, 2868fill in its progress_monitor field with a pointer to your callback routine, 2869and set cinfo->progress to point to the struct. The callback will be called 2870whenever cinfo->progress is non-NULL. (This pointer is set to NULL by 2871jpeg_create_compress or jpeg_create_decompress; the library will not change 2872it thereafter. So if you allocate dynamic storage for the progress struct, 2873make sure it will live as long as the JPEG object does. Allocating from the 2874JPEG memory manager with lifetime JPOOL_PERMANENT will work nicely.) You 2875can use the same callback routine for both compression and decompression. 2876 2877The jpeg_progress_mgr struct contains four fields which are set by the library: 2878 long pass_counter; /* work units completed in this pass */ 2879 long pass_limit; /* total number of work units in this pass */ 2880 int completed_passes; /* passes completed so far */ 2881 int total_passes; /* total number of passes expected */ 2882During any one pass, pass_counter increases from 0 up to (not including) 2883pass_limit; the step size is usually but not necessarily 1. The pass_limit 2884value may change from one pass to another. The expected total number of 2885passes is in total_passes, and the number of passes already completed is in 2886completed_passes. Thus the fraction of work completed may be estimated as 2887 completed_passes + (pass_counter/pass_limit) 2888 -------------------------------------------- 2889 total_passes 2890ignoring the fact that the passes may not be equal amounts of work. 2891 2892When decompressing, pass_limit can even change within a pass, because it 2893depends on the number of scans in the JPEG file, which isn't always known in 2894advance. The computed fraction-of-work-done may jump suddenly (if the library 2895discovers it has overestimated the number of scans) or even decrease (in the 2896opposite case). It is not wise to put great faith in the work estimate. 2897 2898When using the decompressor's buffered-image mode, the progress monitor work 2899estimate is likely to be completely unhelpful, because the library has no way 2900to know how many output passes will be demanded of it. Currently, the library 2901sets total_passes based on the assumption that there will be one more output 2902pass if the input file end hasn't yet been read (jpeg_input_complete() isn't 2903TRUE), but no more output passes if the file end has been reached when the 2904output pass is started. This means that total_passes will rise as additional 2905output passes are requested. If you have a way of determining the input file 2906size, estimating progress based on the fraction of the file that's been read 2907will probably be more useful than using the library's value. 2908 2909 2910Memory management 2911----------------- 2912 2913This section covers some key facts about the JPEG library's built-in memory 2914manager. For more info, please read structure.txt's section about the memory 2915manager, and consult the source code if necessary. 2916 2917All memory and temporary file allocation within the library is done via the 2918memory manager. If necessary, you can replace the "back end" of the memory 2919manager to control allocation yourself (for example, if you don't want the 2920library to use malloc() and free() for some reason). 2921 2922Some data is allocated "permanently" and will not be freed until the JPEG 2923object is destroyed. Most data is allocated "per image" and is freed by 2924jpeg_finish_compress, jpeg_finish_decompress, or jpeg_abort. You can call the 2925memory manager yourself to allocate structures that will automatically be 2926freed at these times. Typical code for this is 2927 ptr = (*cinfo->mem->alloc_small) ((j_common_ptr) cinfo, JPOOL_IMAGE, size); 2928Use JPOOL_PERMANENT to get storage that lasts as long as the JPEG object. 2929Use alloc_large instead of alloc_small for anything bigger than a few Kbytes. 2930There are also alloc_sarray and alloc_barray routines that automatically 2931build 2-D sample or block arrays. 2932 2933The library's minimum space requirements to process an image depend on the 2934image's width, but not on its height, because the library ordinarily works 2935with "strip" buffers that are as wide as the image but just a few rows high. 2936Some operating modes (eg, two-pass color quantization) require full-image 2937buffers. Such buffers are treated as "virtual arrays": only the current strip 2938need be in memory, and the rest can be swapped out to a temporary file. 2939 2940If you use the simplest memory manager back end (jmemnobs.c), then no 2941temporary files are used; virtual arrays are simply malloc()'d. Images bigger 2942than memory can be processed only if your system supports virtual memory. 2943The other memory manager back ends support temporary files of various flavors 2944and thus work in machines without virtual memory. They may also be useful on 2945Unix machines if you need to process images that exceed available swap space. 2946 2947When using temporary files, the library will make the in-memory buffers for 2948its virtual arrays just big enough to stay within a "maximum memory" setting. 2949Your application can set this limit by setting cinfo->mem->max_memory_to_use 2950after creating the JPEG object. (Of course, there is still a minimum size for 2951the buffers, so the max-memory setting is effective only if it is bigger than 2952the minimum space needed.) If you allocate any large structures yourself, you 2953must allocate them before jpeg_start_compress() or jpeg_start_decompress() in 2954order to have them counted against the max memory limit. Also keep in mind 2955that space allocated with alloc_small() is ignored, on the assumption that 2956it's too small to be worth worrying about; so a reasonable safety margin 2957should be left when setting max_memory_to_use. 2958 2959 2960Memory usage 2961------------ 2962 2963Working memory requirements while performing compression or decompression 2964depend on image dimensions, image characteristics (such as colorspace and 2965JPEG process), and operating mode (application-selected options). 2966 2967As of v6b, the decompressor requires: 2968 1. About 24K in more-or-less-fixed-size data. This varies a bit depending 2969 on operating mode and image characteristics (particularly color vs. 2970 grayscale), but it doesn't depend on image dimensions. 2971 2. Strip buffers (of size proportional to the image width) for IDCT and 2972 upsampling results. The worst case for commonly used sampling factors 2973 is about 34 bytes * width in pixels for a color image. A grayscale image 2974 only needs about 8 bytes per pixel column. 2975 3. A full-image DCT coefficient buffer is needed to decode a multi-scan JPEG 2976 file (including progressive JPEGs), or whenever you select buffered-image 2977 mode. This takes 2 bytes/coefficient. At typical 2x2 sampling, that's 2978 3 bytes per pixel for a color image. Worst case (1x1 sampling) requires 2979 6 bytes/pixel. For grayscale, figure 2 bytes/pixel. 2980 4. To perform 2-pass color quantization, the decompressor also needs a 2981 128K color lookup table and a full-image pixel buffer (3 bytes/pixel). 2982This does not count any memory allocated by the application, such as a 2983buffer to hold the final output image. 2984 2985The above figures are valid for 8-bit JPEG data precision and a machine with 298632-bit ints. For 12-bit JPEG data, double the size of the strip buffers and 2987quantization pixel buffer. The "fixed-size" data will be somewhat smaller 2988with 16-bit ints, larger with 64-bit ints. Also, CMYK or other unusual 2989color spaces will require different amounts of space. 2990 2991The full-image coefficient and pixel buffers, if needed at all, do not 2992have to be fully RAM resident; you can have the library use temporary 2993files instead when the total memory usage would exceed a limit you set. 2994(But if your OS supports virtual memory, it's probably better to just use 2995jmemnobs and let the OS do the swapping.) 2996 2997The compressor's memory requirements are similar, except that it has no need 2998for color quantization. Also, it needs a full-image DCT coefficient buffer 2999if Huffman-table optimization is asked for, even if progressive mode is not 3000requested. 3001 3002If you need more detailed information about memory usage in a particular 3003situation, you can enable the MEM_STATS code in jmemmgr.c. 3004 3005 3006Library compile-time options 3007---------------------------- 3008 3009A number of compile-time options are available by modifying jmorecfg.h. 3010 3011The JPEG standard provides for both the baseline 8-bit DCT process and 3012a 12-bit DCT process. The IJG code supports 12-bit lossy JPEG if you define 3013BITS_IN_JSAMPLE as 12 rather than 8. Note that this causes JSAMPLE to be 3014larger than a char, so it affects the surrounding application's image data. 3015The sample applications cjpeg and djpeg can support 12-bit mode only for PPM 3016and GIF file formats; you must disable the other file formats to compile a 301712-bit cjpeg or djpeg. (install.txt has more information about that.) 3018At present, a 12-bit library can handle *only* 12-bit images, not both 3019precisions. 3020 3021Note that a 12-bit library always compresses in Huffman optimization mode, 3022in order to generate valid Huffman tables. This is necessary because our 3023default Huffman tables only cover 8-bit data. If you need to output 12-bit 3024files in one pass, you'll have to supply suitable default Huffman tables. 3025You may also want to supply your own DCT quantization tables; the existing 3026quality-scaling code has been developed for 8-bit use, and probably doesn't 3027generate especially good tables for 12-bit. 3028 3029The maximum number of components (color channels) in the image is determined 3030by MAX_COMPONENTS. The JPEG standard allows up to 255 components, but we 3031expect that few applications will need more than four or so. 3032 3033On machines with unusual data type sizes, you may be able to improve 3034performance or reduce memory space by tweaking the various typedefs in 3035jmorecfg.h. In particular, on some RISC CPUs, access to arrays of "short"s 3036is quite slow; consider trading memory for speed by making JCOEF, INT16, and 3037UINT16 be "int" or "unsigned int". UINT8 is also a candidate to become int. 3038You probably don't want to make JSAMPLE be int unless you have lots of memory 3039to burn. 3040 3041You can reduce the size of the library by compiling out various optional 3042functions. To do this, undefine xxx_SUPPORTED symbols as necessary. 3043 3044You can also save a few K by not having text error messages in the library; 3045the standard error message table occupies about 5Kb. This is particularly 3046reasonable for embedded applications where there's no good way to display 3047a message anyway. To do this, remove the creation of the message table 3048(jpeg_std_message_table[]) from jerror.c, and alter format_message to do 3049something reasonable without it. You could output the numeric value of the 3050message code number, for example. If you do this, you can also save a couple 3051more K by modifying the TRACEMSn() macros in jerror.h to expand to nothing; 3052you don't need trace capability anyway, right? 3053 3054 3055Portability considerations 3056-------------------------- 3057 3058The JPEG library has been written to be extremely portable; the sample 3059applications cjpeg and djpeg are slightly less so. This section summarizes 3060the design goals in this area. (If you encounter any bugs that cause the 3061library to be less portable than is claimed here, we'd appreciate hearing 3062about them.) 3063 3064The code works fine on ANSI C and C++ compilers, using any of the popular 3065system include file setups, and some not-so-popular ones too. 3066 3067The code is not dependent on the exact sizes of the C data types. As 3068distributed, we make the assumptions that 3069 char is at least 8 bits wide 3070 short is at least 16 bits wide 3071 int is at least 16 bits wide 3072 long is at least 32 bits wide 3073(These are the minimum requirements of the ANSI C standard.) Wider types will 3074work fine, although memory may be used inefficiently if char is much larger 3075than 8 bits or short is much bigger than 16 bits. The code should work 3076equally well with 16- or 32-bit ints. 3077 3078In a system where these assumptions are not met, you may be able to make the 3079code work by modifying the typedefs in jmorecfg.h. However, you will probably 3080have difficulty if int is less than 16 bits wide, since references to plain 3081int abound in the code. 3082 3083char can be either signed or unsigned, although the code runs faster if an 3084unsigned char type is available. If char is wider than 8 bits, you will need 3085to redefine JOCTET and/or provide custom data source/destination managers so 3086that JOCTET represents exactly 8 bits of data on external storage. 3087 3088The JPEG library proper does not assume ASCII representation of characters. 3089But some of the image file I/O modules in cjpeg/djpeg do have ASCII 3090dependencies in file-header manipulation; so does cjpeg's select_file_type() 3091routine. 3092 3093The JPEG library does not rely heavily on the C library. In particular, C 3094stdio is used only by the data source/destination modules and the error 3095handler, all of which are application-replaceable. (cjpeg/djpeg are more 3096heavily dependent on stdio.) malloc and free are called only from the memory 3097manager "back end" module, so you can use a different memory allocator by 3098replacing that one file. 3099 3100More info about porting the code may be gleaned by reading jconfig.txt, 3101jmorecfg.h, and jinclude.h. 3102