Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
I want to develop an eBook reader app. What are some good libraries available to parse formats like .azw, .mobi, .pdf etc.?
As Ranhiru said, here and here you can see how PDFs are parsed.
For .mobi, however, there is no library, so you'll have to parse the format yourself. A full specification of the format can be read on the mobileread wiki.
With .azw files, it's different: if the Kindle ebook is DRM-free, then its format coincides with the .mobi one, i.e. they are absolutely interchangeable. Otherwise, it's very difficult to do, since you'll also have to generate a Kindle PID and perform the de-DRM-ing of the .azw file. There's a guide on how to do that on the desktop here. However, it is strongly not recommended, since it breaks the whole point of DRM and is illegal pretty much everywhere.
For mobi there isn't complete spec sheet available, but you should directly jump into PDB format which is extended & used by MOBI
http://jola.comm.pl/palm/opispdb.htm
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
Does anyone know a good library/jar to parse RSS/Atom Feeds?
I'd like to stuff a URL in, and want to get the news-items in a homogenous way. (Means: it should not matter whether the Source contains an Atom or RSS feed. I just want "Items" back.)
After browsing on SO, I came up with ROME, but it seems to be chaotic at the moment (no download for example), and is a pure Java solution. Scala would be preferred, but Java is quite okay if nothing Scala-specific exists. Also, it should be a single JAR Library, since I don't use Maven etc.
PS: It's not for Android, just for a good old desktop app.
EDIT: To be more clear, I already know how to get the content from resources as XML. I want to parse them! Autodetect whether it's Atom or some RSS and give me back a uniform list of items.
Have you checked out Dispatch? It doesn't have any RSS/Atom features built in per se, but you should be able to handle the response text as a scala.xml.NodeSeq and do whatever you need.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I'm developing application which needs to get details from AutoCAD DXF file. Is there any specific API which support this task. Actually in my project I need to get building structures to my java application and I need to add some more details to those graphs So can some one provide your experience about this ?
A quick Google search on java dxf yields the ycad library. I have never used it, I don't know how well it works.
http://caff.de/dxfviewer/ supports reading DXF files.
The reading of an AutoCAD DXF file is not a trivial task. The contained entities can be affected by the coordinate systems, the blocks can be hierarchycal, etc. When I wrote my DXF importer, I read the file sequentialy, built a DXF DOM and then converted it into my own format. You can find the document specification here:
http://images.autodesk.com/adsk/files/autocad_2012_pdf_dxf-reference_enu.pdf
http://www.autodesk.com/techpubs/autocad/acadr14/dxf/dxf_reference.htm
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am looking for a good library or some project that has been done in the area of SMS text normalization. I have found some good research projects like this one.
I am using Java as the programming language.
The concept in a nutshell is to handle SMS based text like "tel him 2 go home nw" and convert it to normal english language text "tell him to go home now".
Why not just to download a dictionary from a site like this: http://smsdictionary.co.uk/abbreviations and use a string replacement?
Dictionary substitution does not cut it, since it misses context in translations. e.g. do you translate '2' to 'to', 'too' or 'two'?
You can get a corpus and train a statistical model yourself, either using Moses (http://www.statmt.org/moses/) or Phrasal (http://nlp.stanford.edu/software/phrasal/).
As an author of the Stanford one (http://www-nlp.stanford.edu/sms/translate.php), I could be convinced to offer a REST based API for such a service, but I don't know the demand for it...
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Can Someone suggest me a good open source java library to read ECG data in MFER / HL7 / Other formats?
There are a number of options for a java library that parses HL7. For example, you could use the HAPI library, available on sourceforge at http://hl7api.sourceforge.net/. There is also a .NET version of that library at http://nhapi.sourceforge.net/home.php, for those that prefer that platform. Another Java-based option is HL7Comm at http://nule.org/wp/?page_id=63.
More options can be found on the Wikipedia page at http://en.wikipedia.org/wiki/Health_Level_7#Open_source_tools, and at http://www.hl7.org.au/HL7-Tools.htm.
For ECG processing in general, see the OpenECG portal at http://www.openecg.net/.
One of the standard ECG software analysis libraries is WFDB and PhysioNet:
http://www.physionet.org/physiotools/wag/wag.htm
Two American National Standards, ANSI/AAMI EC38:1998 (Ambulatory Electrocardiographs) and ANSI/AAMI EC57:1998 (Testing and Reporting Performance Results of Cardiac Rhythm and ST Segment Measurement Algorithms) require the use of several of the WFDB applications for evaluation of certain devices and algorithms.
There are wrappers written, so you can use Java to access the code:
http://www.physionet.org/physiotools/wfdb-swig.shtml
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I have extracted a number of OleObject files from a Word DOCX document (oleObject1.bin ... etc.) Is there Open Source code (primarily in Java) to read this. (I would only be interested in *.net code if it can be easily ported to Java). I am not interested in the rest of the Word document (in this question). There is an Apache project (http://poi.apache.org/) - I'd be interested in how well this works
Apache POI - POIFS is a Java implementation of the OLE 2 Compound Document format. In my experience it works pretty well for the sort of scenario you are reffering to. Check it out.