Jexcelapi Tutorial
Jexcelapi Tutorial
Contents
Introduction
Reading a spreadsheet
Writing a spreadsheet
Fundamentals
Adding format information
Formatting numbers
Formatting dates
Copying and modifying a spreadsheet
Demo & Test programs
Frequently Asked Questions
java.lang.OutOfMemory Exception
Compiling
Uploading a Spreadsheet
Support for charts, macros and images
Date display
Cell formats across multiple workbooks
Cross sheet formulas
Introduction
The information presented in this tutorial is intended as a brief overview as to how JExcelApi may
be used to read and write data in Excel format. The tutorial is by no means exhaustive, and if
something is not described here, it does not mean that it cannot be done. The reader is encouraged to
examine the API documentation and the sample code (particularly Write.java and ReadWrite.java) in
order to gain a more complete understanding of the capabilities and limitations of the software.
Reading Spreadsheets
JExcelApi can read an Excel spreadsheet from a file stored on the local filesystem or from some
input stream. The first step when reading a spreadsheet from a file or input stream is to create a
Workbook. The code fragment below illustrates creating a workbook from a file on the local
filesystem.
import java.io.File;
import java.util.Date;
import jxl.*;
...
(NOTE: when creating a spreadsheet from a ServletInputStream you must remove the HTTP header
information before creating the Workbook object.)
Once you have accessed the workbook, you can use this to access the individual sheets. These are
zero indexed - the first sheet being 0, the second sheet being 1, and so on. (You can also use the API
to retrieve a sheet by name).
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
Once you have a sheet, you can then start accessing the cells. You can retrieve the cell's contents as a
string by using the convenience method getContents(). In the example code below, A1 is a text cell,
B2 is numerical value and C2 is a date. The contents of these cells may be accessed as follows
Cell a1 = sheet.getCell(0,0);
Cell b2 = sheet.getCell(1,1);
Cell c2 = sheet.getCell(2,1);
The demo programs CSV.java and XML.java use the convenience method getContents() in order to
output the spreadsheet contents.
However if it is required to access the cell's contents as the exact type ie. as a numerical value or as a
date, then the retrieved Cell must be cast to the correct type and the appropriate methods called. The
section of code below illustrates how JExcelApi may be used to retrieve a genuine java double and
java.util.Date object from an Excel spreadsheet. For completeness the label is also cast to it's correct
type, although in practice this makes no difference. The example also illustrates how to verify that
cell is of the expected type - this can be useful when validating that the spreadsheet has cells in the
correct place.
Cell a1 = sheet.getCell(0,0);
Cell b2 = sheet.getCell(1,1);
Cell c2 = sheet.getCell(2,1);
if (a1.getType() == CellType.LABEL)
{
LabelCell lc = (LabelCell) a1;
stringa1 = lc.getString();
}
if (b2.getType() == CellType.NUMBER)
{
NumberCell nc = (NumberCell) b2;
numberb2 = nc.getValue();
}
if (c2.getType() == CellType.DATE)
{
DateCell dc = (DateCell) c2;
datec2 = dc.getDate();
}
When you have finished processing all the cells, use the close() method. This frees up any allocated
memory used when reading spreadsheets and is particularly important when reading large
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
spreadsheets.
Writing Spreadsheets
Fundamentals
This section describes how to write out simple spreadsheet data without any formatting information,
such as fonts or decimal places.
Similarly to reading a spreadsheet, the first step is to create a writable workbook using the factory
method on the Workbook class.
import java.io.File;
import java.util.Date;
import jxl.*;
import jxl.write.*;
...
This creates the workbook object. The generated file will be located in the current working directory
and will be called "output.xls". The API can also be used to send the workbook directly to an output
stream eg. from a web server to the user's browser. If the HTTP header is set correctly, then this will
launch Excel and display the generated spreadsheet.
The next stage is to create sheets for the workbook. Again, this is a factory method, which takes the
name of the sheet and the position it will occupy in the workbook. The code fragment below creates
a sheet called "First Sheet" at the first position.
Now all that remains is to add the cells into the worksheet. This is simply a matter of instantiating
cell objects and adding them to the sheet. The following code fragment puts a label in cell A3, and
the number 3.14159 in cell D5.
There are a couple of points to note here. Firstly, the cell's location in the sheet is specified as part of
the constructor information. Once created, it is not possible to change a cell's location, although the
cell's contents may be altered.
The other point to note is that the cell's location is specified as (column, row). Both are zero indexed
integer values - A1 being represented by (0,0), B1 by (1,0), A2 by (0,1) and so on.
Once you have finished adding sheets and cells to the workbook, you call write() on the workbook,
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
and then close the file. This final step generates the output file (output.xls in this case) which may be
read by Excel. If you call close() without calling write() first, a completely empty file will be
generated.
...
// All sheets and cells added. Now write out the workbook
workbook.write();
workbook.close();
The previous section illustrates the fundamentals of generating an Excel compatible spreadsheet
using the JExcelApi. However, as it stands Excel will render the data in the default font, and will
display the numbers to 3 decimal places. In order to supply formatting information to Excel, we must
make use of the overloaded constructor, which takes an additional object containing the cell's
formatting information (both the font and the style).
The code fragment below illustrates creating a label cell for an arial 10 point font.
Cell formats objects are shared, so many cells may use the same format object, eg.
This creates another label, with the same format, in cell C1.
Because cell formats are shared, it is not possible to change the contents of a cell format object. (If
this were permitted, then changing the contents of the object could have unforeseen repurcussions on
the look of the rest of the workbook). In order to change the way a particular cell is displayed, the
API does allow you to assign a new format to an individual cell.
The constructors for the WritableFont object have many overloads. By way of example, the code
fragment below creates a label in 16 point Times, bold italic and assigns it to position D1.
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
sheet.addCell(label4);
Formatting Numbers
Number formatting information may be passed to the cell format object by a similar mechanism to
that described for fonts.
A variety of predefined number formats are defined statically. These may be used to format
numerical values as follows:
The above code inserts the value 3.14159 into cells A5 and B5, using the preset integer and floating
points format respectively. When Excel renders these cells, A5 will display as "3" and B5 will
display as "3.14", even though both cells contain the same floating point value.
It's possible for a user to define their own number formats, by passing in a number format string. The
string passed in should be in the same format as that used by the java.text.DecimalFormat class. To
format a number to display up to five decimal places in cell C5, the following code fragment may be
used:
It is, of course, also possible to specify font information as well eg. to display the same value in the
16 point times bold font defined earlier we can write
Formatting Dates
Dates are handled similarly to numbers, taking in a format compatible with that used by the
java.text.SimpleDateFormat class. In addition, several predefined date formats are specified in the
jxl.write.DateFormat class.
As a brief example, the below code fragment illustrates placing the current date and time in cell A7
using a custom format:
// Get the current date and time from the Calendar object
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
Date now = Calendar.getInstance().getTime();
DateFormat customDateFormat = new DateFormat ("dd MMM yyyy
hh:mm:ss");
WritableCellFormat dateFormat = new WritableCellFormat
(customDateFormat);
DateTime dateCell = new DateTime(0, 6, now, dateFormat);
sheet.addCell(dateCell);
As with numbers, font information may be used to display the date text by using the overloaded
constructors on WritableCellFormat.
For a more extensive example of writing spreadsheets, the demonstration program Write.java should
be studied. In addition to the functionality described above, this program tests out a variety of cell,
formatting and font options, as well as displaying cells with different background and foreground
colours, shading and boundaries.
import java.io.File;
import java.util.Date;
import jxl.*;
import jxl.write.*;
...
This creates a readable spreadsheet. To obtain a writable version of this spreadsheet, a copy must be
made, as follows:
The API functions this way is for reasons of read efficiency (since this is the primary use of the
API). In order to improve performance, data which relates to output information (eg. all the
formatting information such as fonts) is not interpreted when the spreadsheet is read, since this is
superfluous when interrogating the raw data values. However, if we need to modify this spreadsheet
a handle to the various write interfaces is needed, which can be obtained using the copy method
above. This copies the information that has already been read in as well as performing the additional
processing to interpret the fields that are necessary to for writing spreadsheets. The disadvantage of
this read-optimized strategy is that we have two spreadsheets held in memory rather than just one,
thus doubling the memory requirements. For this reason copying and modifying large spreadsheets
can be expensive in terms of processing and memory.
Once we have a writable interface to the workbook, we may retrieve and modify cells. The following
code fragment illustrates how to modify the contents of a label cell located in cell B3 in sheet 2 of
the workbook.
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
WritableCell cell = sheet2.getWritableCell(1, 2);
if (cell.getType() == CellType.LABEL)
{
Label l = (Label) cell;
l.setString("modified cell");
}
There is no need to call the add() method on the sheet, since the cell is already present on the sheet.
The contents of numerical and date cells may be modified in a similar way, by using the setValue()
and setDate() methods respectively.
Although cell formats are immutable, the contents of a cell may be displayed differently by assigning
a different format object to the cell. The following code fragment illustrates changing the format of
numerical cell (in position C5) so that the contents will be displayed to five decimal places.
cell.setFormat(cellFormat);
Since the copy of the workbook is an ordinary writable workbook, new cells may be added to the
sheet, thus:
As before, once the modifications are complete, the workbook must be written out and closed.
...
// All cells modified/added. Now write out the workbook
copy.write();
copy.close();
The demo program contained in the source file ReadWrite.java may be studied as a more exhaustive
example of how spreadsheets may be modified. This demo program copies the spreadsheet passed in
on the command line; if the spreadsheet to be copied is the example spreadsheet, jxlrwtest.xls,
located in the current directory, then certain modifications are carried out. DO NOT MODIFY THE
EXAMPLE SPREADSHEET, otherwise the demo program will not work.
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
Reads spreadsheet.xls and writes out the corresponding csv data to the standard output. The -csv
option is the default and may omitted
Reads spreadsheet.xls and writes out the corresponding cell data to the standard output as XML.
As above, but includes formatting information (font, number formats etc) in the generated XML
Reads spreadsheet.xls and displays all the formulas contained in that sheet.
Reads in.xls, copies it and generates a file called out.xls. If the spreadsheet passed in is a special
sample spreadsheet called jxlrwtest.xls, then this demo will modify specific cells in the copy, out.xls.
java.lang.OutOfMemory Exception
By default a JVM places an upper limit on the amount of memory available to the current process in
order to prevent runaway processes gobbling system resources and making the machine grind to a
halt. When reading or writing large spreadsheets, the JVM may require more memory than has been
allocated to the JVM by default - this normally manifests itself as a java.lang.OutOfMemory
exception.
For command line processes, you can allocate more memory to the JVM using the -Xms and -Xmx
options eg. to allocate an initial heap allocation of 10 mB, with 100 mB as the upper bound you can
use
In order to allocate more memory in this manner to servlets/JSPs, consult the help documentation for
the Web Application Server.
Compiling
The distribution of JExcelApi comes with a build.xml file. This may be used by the build tool, ant, in
order to build the software. If ant is not already installed on your machine, it may obtained here.
To build API using ant simply change to the subdirectory called build, from the command line, type
ant
This will detect any source files which have recent changes, compile them and create the jar file in
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
the parent directory.
The build.xml specifies a number of targets. To totally rebuild the whole application, including the
javadoc documentation, then obtain a command line prompt within the build directory and type
ant jxlall
As an alternative to using ant, JExcelApi may be built using the standard java tools. From the
command line in the build subdirectory issue the following sequence of commands (modifying file
separators and classpath separators as required for the target operating system):
Below is some indicative code which may be used for uploading spreadsheets from a client browser
to servlet.
In the HTML page which is displayed to the user requesting the upload, declare a form of multipart
form data:
The servlet which processes this code should access the input stream directly. Because of the
encoding method, it is not possible to use the request.getParameter() methods.
Accessing the input stream directly means that the HTTP information is present. The first thing to do
is strip off this redundant information before passing the input stream directly to the API, thus:
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
}
catch (JXLException e)
{
...
}
}
JExcelApi has limited support for charts: when copying a spreadsheet containing a chart, the chart is
written out to the generated spreadsheet (as long as the sheet containing the chart contains other data
as well as the chart).
All macro and information is ignored. Consequently when copying and writing out the macros that
were in the original will not be present in the generated version.
All image information is preserved when copying spreadsheets. When adding an image to a
spreadsheet only images in PNG format are supported
Date display
When displaying dates, the java.util package automatically adjusts for the local timezone. This can
cause problems when displaying dates within an application, as the dates look as if they are exactly
one day previous to that which is stored in the Excel spreadsheet, although this is not in fact the case.
Excel stores dates as a numerical value, and the conversion process for transforming this into a
java.util.Date consists of converting the Excel number into a UTC value and then using the UTC
number to create the java Date. Say the number in Excel represents 20 August 2003, then the UTC
equivalent of this number is used to create a java.util.Date object.
The problem occurs if you are operating in a timezone other than GMT. As soon as you try and
perform any user IO on that java Date object (eg. System.out.print(date)) the JVM will perform
timezone adjustment calculations. If you are located in EST zone (which is GMT - 5 hours) java will
subtract 5 hours from the date - so the Date object instead of being 00:00 20/08/2003 becomes 19:00
19/08/2003. Because java recognizes you only want a date and not a date time, it truncates the
hours/minutes/seconds and presents 19/08/2003 - so it appears that the day is one day less than was
stored in Excel, whereas it is really only a few hours (the timezone offset) less. Needless to say, this
is a very annoying feature.
The easiest way to work around this (and the method used internally by the getContents() method of
a jxl.DateCell) is to force the timezone of the date format as follows:
Sometimes a single process may generate multiple workbooks. When doing this is it tempting to
create the various cell formats once (eg. as member data or as static constants) and apply them to
cells in both workbooks. This works fine for the first workbook, but for subsequent workbooks this
can cause unexpected cell formatting. The reason for this is that when a format is first added to a
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30
workbook, JExcelApi assigns an internal cross-reference number to that cell, and all other cells
which share this format simply store the cross reference number. However, when you add the same
cell format to a different workbook, JExcelApi recognizes that the format has been added to a
workbook, and simply refers to the format by the index number, rather than by initializing it
properly. When Excel then tries to read this workbook, it sees an index number, but is unable to read
the cell format (or reads a different one) as the cell can be formatted in an unpredictable manner.
The long and the short of it is that if it is necessary to re-use formats across multiple workbooks, then
the WritableCellFormat objects must be re-created and initialised along with the each Workbook
instance, and NOT re-used from a previous workbook.
JExcelApi supports formulas across sheets. However, please make sure all the sheets have been
added to the workbook (even if they are blank) before attempting to create cross sheet formulas. This
is because if you create a cross sheet formula referencing a sheet and then subsequently and or
remove sheets from the workbook, the sheet reference in the formula when it was parsed won't
necessarily reference the correct sheet, and could even cause Excel to crash
http://www.andykhan.com/jexcelapi/tutorial.html 2008-05-30