Use POI to parse Excel but got exception "Invalid Header Signature"

Use POI to parse Excel but got exception "Invalid Header Signature" - java

I was trying to use Apache POI (Version 3.6) to parse Excel .xls file, but got only Exception:
java.io.IOException: Invalid header signature; read 0x07B1FD124BEDF108, expected 0xE11AB1A1E011CFD0
I have Googled some result, which basically said that "The file is actually not a valid excel file (i.e. .csv and so on) but ended with the suffix .xls". But I'm quite sure that my excel file is valid (in Excel97-2003 format).
For secrecy considerations, I couldn't post my excel, but when I use emacs hexl-mode to view this binary excel file, the header is:
D0CF 11E0 A1B1 1AE1
I think it is just what POI expected (E11AB1A1E011CFD0, but in big-endian). But why I got the exception?
BTW, if I use vim with command %!xxd to view the same excel file, I got a header different from eamcs:
C390 C38F 11C3 A0C2
And the whole binary file seems totally different. I cannot understand.
Thanks for any of your help!

If you get that exception, then your file really isn't a true .xls file. It will instead either be some other file, renamed to have a .xls extension, or a corrupted file.
I'd suggest you try opening the file in Excel, and do a Save-As. That may give you a hint as to the file type. If not, do a save-as as Excel .xls, and then you'll be able to open that file
I don't know what you file is (I don't recognise the header), but I can assure you that it isn't an OLE2 header as a valid .xls file would have.
It's possible that Apache Tika may be able to work out what kind of binary file it is, so you could always try with the Tika-App jar

Just an idea, if you using maven make sure in the resource tag filtering is set to false. Otherwise maven tends to corrupt xls files in the copying phase in in your pom.xml

Related

Why does unpacking the contents of the same zipped file differ in their byte arrays?

We have zip archives on our server that contain .csv files. If requested, we want to deliver those files as .xlsx within a newly created archive. I have already written the code to do the conversion using apache-poi and am having trouble asserting the results during testing.
In my test, I have one input zip file that contains a .csv. I run the test once and store the result of the conversion as a zip file (to use it as the expected output afterwards). I understand that comparing the resulting zip archives directly is not possible, so I unpack them, read all bytes of each file in both folders and compare them with Assertions.assertArrayEquals. This is where I run into trouble. Here and there, the arrays are off in both length and content.
Can anyone tell me why that is? When I look at the actual files, they are the same and how they should look like after conversion.

Convert faulty Webpage/Excel to proper Excel

I have an app that automatically processes a range of excel files but i have one issue. For some files I have what seems to be an html file with a .xls file extension (opening in excel gives corrupt warning and resaving shows it wants to save as an html).
When using Apachi POI:
try (Workbook wkbk = WorkbookFactory.create(myCorruptFile)) {
//myCorruptFile is of type File
This fails to process with apache poi NotOLE2FileException error below
Invalid header signature; read 0x0A0D3E6C6D74683C, expected 0xE11AB1A1E011CFD0 - Your file appears not to be a valid OLE2 document, { }
If I manually resave as a .xls the file will process appropriately, but is there a way to detect and resave/convert this file via java 11? Manually converting the files isn't an option for me as opposed to an automated one.
myCorruptFile.getContentType()
Gives content type as:
application/vnd.ms-excel
And using Apache Tika gives detected type as:
tika.detect(myCorruptFile.getBytes())
text/html
(My maven pom has no filtering)

How to append text into an existing file using java 1.4

I want to append into the existing text file. For that I have tried all this in plenty of way FileWriter,BufferedWriter,PrintWriter,RandomAccessFile,OutputStream,FileOutputStream,PrintStream but I can't get my desired output.
This error java.io.FileNotFoundException: could not open file '//file:/usr/backupdata/5605.txt' using mode 'a+' sucks. (I am working with ewon flexy hardware which supports javaetk 1.4 only)

You are trying to use a url as a file name. That won’t work.
Use
/use/backupdata/5605.txt
as the file name.

Apache POI org.apache.poi.ss.formula.FormulaParseException for shiftrows function call.

I am using Apache POI to read and write Excel files for both xls and xlsx formats.
If the code processes folloiwng line for a file written by POI/my code , it doesn't throw an exception but in case of a file written from Excel by user, I get
org.apache.poi.ss.formula.FormulaParseException: Specified named range 'LOCAL_YEAR_FORMAT' does not exist in the current workbook.
The Exception is fired at:
postWB.getSheet(postSheet.getSheetName()).shiftRows(i, postSheet.getPhysicalNumberOfRows(), 1);
As it turns out from some of the questions here, it can be of jar compatibilities or due to bug. I have changed all jars for latest poi library and dependencies.
Any workaround to shiftrows without finding this exception.

R - Error: IllegalArgumentException (Java): Your InputStream was neither an OLE2 stream, nor an OOXML stream

I use R XLConnect package.
When I wrote 'XLConnect' function, such as loadWorkbook(), readWorksheetFromFile() etc. , this error message happen.
Error: IllegalArgumentException (Java): Your InputStream was neither
an OLE2 stream, nor an OOXML stream
How to solve this problem?
Before using this function, I took action against crashing between R and Mac OS X by the way http://www.r-bloggers.com/getting-r-and-java-1-8-to-work-together-on-osx/ link told.
I have used Mac OS X.

This message states that the file you have provided to loadWorkbook has not been recognized as *.xls (BIFF-8) or *.xlsx (OOXML) file.

I am having the same issue following a Java update.
I was asking to load a .xlsx file to the function loadWorkbook() of the R XLConnect package.
I temporary solved the issue by asking to load an .xls file.

I also use OS X and after working without problem for a while with this function this error raises without apparent reason... But the reason is really simple. Excel (actually, all MS Office suite) creates temporary files meanwhile you have open the file. This file is hidden:
In my case, I list .xlsx files to open them inside a loop. So, the first file was a hidden file and the error raised. Closing excel (to delete those file) is the solution to avoid this error.

Develop Reference

Java is a programming language and computing platform first released by Sun Microsystems in 1995.

Use POI to parse Excel but got exception "Invalid Header Signature" - java

Just an idea, if you using maven make sure in the resource tag filtering is set to false. Otherwise maven tends to corrupt xls files in the copying phase in in your pom.xml

Related

Why does unpacking the contents of the same zipped file differ in their byte arrays?

Convert faulty Webpage/Excel to proper Excel

How to append text into an existing file using java 1.4

Apache POI org.apache.poi.ss.formula.FormulaParseException for shiftrows function call.

R - Error: IllegalArgumentException (Java): Your InputStream was neither an OLE2 stream, nor an OOXML stream

Categories

Resources