Class CSVParser
- All Implemented Interfaces:
- Closeable,- AutoCloseable,- Iterable<CSVRecord>
CSVFormat.
 The parser works record-wise. It is not possible to go back, once a record has been parsed from the input stream.
 Creating instances
There are several static factory methods that can be used to create instances for various types of resources:
- parse(java.io.File, Charset, CSVFormat)
- parse(String, CSVFormat)
- parse(java.net.URL, java.nio.charset.Charset, CSVFormat)
 Alternatively parsers can also be created by passing a Reader directly to the sole constructor.
 For those who like fluent APIs, parsers can be created using CSVFormat.parse(java.io.Reader) as a shortcut:
 
 for (CSVRecord record : CSVFormat.EXCEL.parse(in)) {
     ...
 }
 
 Parsing record wise
To parse a CSV input from a file, you write:
 File csvData = new File("/path/to/csv");
 CSVParser parser = CSVParser.parse(csvData, CSVFormat.RFC4180);
 for (CSVRecord csvRecord : parser) {
     ...
 }
 This will read the parse the contents of the file using the RFC 4180 format.
To parse CSV input in a format like Excel, you write:
 CSVParser parser = CSVParser.parse(csvData, CSVFormat.EXCEL);
 for (CSVRecord csvRecord : parser) {
     ...
 }
 
 
 If the predefined formats don't match the format at hand, custom formats can be defined. More information about
 customizing CSVFormats is available in CSVFormat Javadoc.
 
Parsing into memory
If parsing record-wise is not desired, the contents of the input can be read completely into memory.
 Reader in = new StringReader("a;b\nc;d");
 CSVParser parser = new CSVParser(in, CSVFormat.EXCEL);
 List<CSVRecord> list = parser.getRecords();
 There are two constraints that have to be kept in mind:
- Parsing into memory starts at the current position of the parser. If you have already parsed records from the input, those records will not end up in the in-memory representation of your CSV data.
- Parsing into memory may consume a lot of system resources depending on the input. For example, if you're parsing a 150MB file of CSV data the contents will be read completely into memory.
Notes
The internal parser state is completely covered by the format and the reader state.
- See Also:
- 
Nested Class SummaryNested Classes
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionstatic CSVParser.Builderbuilder()Creates a new builder.voidclose()Closes resources.longGets the current line number in the input stream.Gets the first end-of-line string encountered.Gets the header comment, if any.Gets a copy of the header map as defined in the CSVFormat's header.Gets a read-only list of header names that iterates in column order as defined in the CSVFormat's header.longGets the current record number in the input stream.Parses the CSV input according to the given format and returns the content as a list ofCSVRecords.Gets the trailer comment, if any.booleanChecks whether there is a header comment.booleanChecks whether there is a trailer comment.booleanisClosed()Tests whether this parser is closed.iterator()Returns the record iterator.static CSVParserCreates a parser for the givenFile.static CSVParserparse(InputStream inputStream, Charset charset, CSVFormat format) Creates a CSV parser using the givenCSVFormat.static CSVParserCreates a CSV parser using the givenCSVFormatstatic CSVParserCreates a parser for the givenString.static CSVParserCreates and returns a parser for the given URL, which the caller MUST close.static CSVParserCreates and returns a parser for the givenPath, which the caller MUST close.stream()Returns a sequentialStreamwith this collection as its source.Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface java.lang.IterableforEach, spliterator
- 
Constructor Details- 
CSVParserDeprecated.Will be removed in the next major version, useCSVParser.Builder.get().Constructs a new instance using the givenCSVFormat.If you do not read all records from the given reader, you should callclose()on the parser, unless you close thereader.- Parameters:
- reader- a Reader containing CSV-formatted input. Must not be null.
- format- the CSVFormat used for CSV parsing. Must not be null.
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent or if either reader or format are null.
- IOException- If there is a problem reading the header or skipping the first record
- CSVException- Thrown on invalid CSV input data.
 
- 
CSVParser@Deprecated public CSVParser(Reader reader, CSVFormat format, long characterOffset, long recordNumber) throws IOException Deprecated.Will be removed in the next major version, useCSVParser.Builder.get().Constructs a new instance using the givenCSVFormat.If you do not read all records from the given reader, you should callclose()on the parser, unless you close thereader.- Parameters:
- reader- a Reader containing CSV-formatted input. Must not be null.
- format- the CSVFormat used for CSV parsing. Must not be null.
- characterOffset- Lexer offset when the parser does not start parsing at the beginning of the source.
- recordNumber- The next record number to assign.
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent or if either the reader or format is null.
- IOException- if there is a problem reading the header or skipping the first record
- CSVException- on invalid input.
- Since:
- 1.1
 
 
- 
- 
Method Details- 
builderCreates a new builder.- Returns:
- a new builder.
- Since:
- 1.13.0
 
- 
parseCreates a parser for the givenFile.- Parameters:
- file- a CSV file. Must not be null.
- charset- The Charset to decode the given file,- nullmaps to the- default Charset.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new parser
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent.
- IOException- If an I/O error occurs
- CSVException- Thrown on invalid CSV input data.
- NullPointerException- if- fileis- null.
 
- 
parsepublic static CSVParser parse(InputStream inputStream, Charset charset, CSVFormat format) throws IOException Creates a CSV parser using the givenCSVFormat.If you do not read all records from the given reader, you should callclose()on the parser, unless you close thereader.- Parameters:
- inputStream- an InputStream containing CSV-formatted input,- nullmaps to- CSVFormat.DEFAULT.
- charset- The Charset to decode the given file,- nullmaps to the- default Charset.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new CSVParser configured with the given reader and format.
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent or if either reader or format are null.
- IOException- If there is a problem reading the header or skipping the first record
- CSVException- Thrown on invalid CSV input data.
- Since:
- 1.5
 
- 
parseCreates and returns a parser for the givenPath, which the caller MUST close.- Parameters:
- path- a CSV file. Must not be null.
- charset- The Charset to decode the given file,- nullmaps to the- default Charset.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new parser
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent.
- IOException- If an I/O error occurs
- CSVException- Thrown on invalid CSV input data.
- NullPointerException- if- pathis- null.
- Since:
- 1.5
 
- 
parseCreates a CSV parser using the givenCSVFormatIf you do not read all records from the given reader, you should callclose()on the parser, unless you close thereader.- Parameters:
- reader- a Reader containing CSV-formatted input. Must not be null.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new CSVParser configured with the given reader and format.
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent or if either reader or format are null.
- IOException- If there is a problem reading the header or skipping the first record
- CSVException- Thrown on invalid CSV input data.
- Since:
- 1.5
 
- 
parseCreates a parser for the givenString.- Parameters:
- string- a CSV string. Must not be null.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new parser
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent.
- IOException- If an I/O error occurs
- CSVException- Thrown on invalid CSV input data.
- NullPointerException- if- stringis- null.
 
- 
parseCreates and returns a parser for the given URL, which the caller MUST close.If you do not read all records from the given url, you should callclose()on the parser, unless you close theurl.- Parameters:
- url- a URL. Must not be null.
- charset- the charset for the resource,- nullmaps to the- default Charset.
- format- the CSVFormat used for CSV parsing,- nullmaps to- CSVFormat.DEFAULT.
- Returns:
- a new parser
- Throws:
- IllegalArgumentException- If the parameters of the format are inconsistent.
- IOException- If an I/O error occurs
- CSVException- Thrown on invalid CSV input data.
- NullPointerException- if- urlis- null.
 
- 
closeCloses resources.- Specified by:
- closein interface- AutoCloseable
- Specified by:
- closein interface- Closeable
- Throws:
- IOException- If an I/O error occurs
 
- 
getCurrentLineNumberGets the current line number in the input stream.Note: If your CSV input has multi-line values, the returned number does not correspond to the record number. - Returns:
- current line number.
 
- 
getFirstEndOfLineGets the first end-of-line string encountered.- Returns:
- the first end-of-line string.
- Since:
- 1.5
 
- 
getHeaderCommentGets the header comment, if any. The header comment appears before the header record.- Returns:
- the header comment for this stream, or null if no comment is available.
- Since:
- 1.10.0
 
- 
getHeaderMapGets a copy of the header map as defined in the CSVFormat's header.The map keys are column names. The map values are 0-based indices. Note: The map can only provide a one-to-one mapping when the format did not contain null or duplicate column names. - Returns:
- a copy of the header map.
 
- 
getHeaderNamesGets a read-only list of header names that iterates in column order as defined in the CSVFormat's header.Note: The list provides strings that can be used as keys in the header map. The list will not contain null column names if they were present in the input format. - Returns:
- read-only list of header names that iterates in column order.
- Since:
- 1.7
- See Also:
 
- 
getRecordNumberGets the current record number in the input stream.Note: If your CSV input has multi-line values, the returned number does not correspond to the line number. - Returns:
- current record number
 
- 
getRecordsParses the CSV input according to the given format and returns the content as a list ofCSVRecords.The returned content starts at the current parse-position in the stream. You can use CSVFormat.Builder.setMaxRows(long)to limit how many rows this method produces.- Returns:
- list of CSVRecords, may be empty
- Throws:
- UncheckedIOException- on parse error or input read-failure
 
- 
getTrailerCommentGets the trailer comment, if any. Trailer comments are located between the last record and EOF- Returns:
- the trailer comment for this stream, or null if no comment is available.
- Since:
- 1.10.0
 
- 
hasHeaderCommentChecks whether there is a header comment. The header comment appears before the header record. Note that if the parser's format has been given an explicit header (withCSVFormat.Builder.setHeader(String...)or another overload) and the header record is not being skipped (CSVFormat.Builder.setSkipHeaderRecord(boolean)is false) then any initial comments will be associated with the first record, not the header.- Returns:
- true if this parser has seen a header comment, false otherwise
- Since:
- 1.10.0
 
- 
hasTrailerCommentChecks whether there is a trailer comment. Trailer comments are located between the last record and EOF. The trailer comments will only be available after the parser has finished processing this stream.- Returns:
- true if this parser has seen a trailer comment, false otherwise
- Since:
- 1.10.0
 
- 
isClosedTests whether this parser is closed.- Returns:
- whether this parser is closed.
 
- 
iteratorReturns the record iterator.An IOExceptioncaught during the iteration is re-thrown as anIllegalStateException.If the parser is closed, the iterator will not yield any more records. A call to Iterator.hasNext()will returnfalseand a call toIterator.next()will throw aNoSuchElementException.If it is necessary to construct an iterator which is usable after the parser is closed, one option is to extract all records as a list with getRecords(), and return an iterator to that list.You can use CSVFormat.Builder.setMaxRows(long)to limit how many rows an Iterator produces.
- 
streamReturns a sequentialStreamwith this collection as its source.If the parser is closed, the stream will not produce any more values. See the comments in iterator().You can use CSVFormat.Builder.setMaxRows(long)to limit how many rows a Stream produces.- Returns:
- a sequential Streamwith this collection as its source.
- Since:
- 1.9.0
 
 
- 
CSVParser.Builder.get().