As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 10 years ago.
Hi I am looking for a open-source java API that can convert tiff image to searchable pdf (OCR). I have research around but found nothing so far.
NOTE
I have looked at this post but this API does not convert the image to pdf
Java OCR implementation. However, I am still playing with the code a bit.
You can convert images to PDF using iText. The hard thing here is doing the OCR, not creating the PDF.
I will warn you: any OCR engine that is worth using is going to cost you a significant amount of money. Free and/or open source ones are generally pet projects, proof of concept for some algorithm or another. Not suitable for real world OCR applications. Tesseract is probably the best of the bunch, but even that has accuracies that are far, far worse than commercial engines.
We have a commercial OCR application, and I've been down this path while evaluating engines - I'd suggest that you bite the bullet and reach out to the engine providers and get quotes: Abbyy (best accuracy, most expensive, slower), Expervision (fast, not as accurate, middle of the road price), Nuance (middle of the road speed, accuracy and price). None of these will be written in Java, so you should plan some time to develop JNI code around their APIs.
Good luck - it's a big project!
Cuneiform is free and easy to use, it will output in hocr format, which can then be used to generate an invisible text layer on a PDF using hocr2pdf tool, which is part of ExactImage.
Related
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 9 years ago.
I'm planning on building a network throughput speed test (speed test) that will eventually be embedded in a website for random people to test their internet connection speed. The test will not only test their speed, but also trigger a test on the server to test the server uplink speed at the same time (to display as a comparison at that moment in time).
I'm very comfortable in Java - but an applet seems to be a poor choice in 2013.
The client is specifying they would like it in either Java (and applet) or Flash (oh god... ). I have no Flash experience, but once you know to program, you know to program... so it would just require learning the Flash toolchain/processes and syntax (hoping this isn't too crazy).
However, I feel Flash is a poor choice as well (it's already being killed off by Adobe and it seems futile to invest so much effort to learn a platform that will go away (relatively) sometime in the near future).
As the domain expert, I am free to make recommendations based on what I feel is not only viable, but makes sense. Javascript comes to mind here, but I have little Javascript experience too (so in same boat of having to learn it). I think Javascript's barrier to entry is much lower though since it's all embedded in HTML pages and I know HTML fairly well.
Anyways, please make recommendations for such a project (platform-wise) and if possible point me in some direction of source code or examples of other speed tests that I can review and learn from (i tried searching but all i get back is functioning speed tests like speedtest.com etc, not source or theory, or design patters behind them).
Test network speed using PHP/Javascript
They didn't find a answer but there are some good suggestions. Just read it :)
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 9 years ago.
I intend to create a webapp in which members of an organization will sign in, filter and search for events, and sign up to attend events. I'm just making this for one organization, but I'll most likely make it open source when I'm done.
I have my own hosting space on GoDaddy, but then I found Google's App Engine and figured it would be more efficient.
I also plan on using Twitter Bootstrap and HTML5 Boilerplate (maybe?). UI is important to me. I'll also make an iPad app with the same function that uses the same data sometime soon.
So Python or Java? What are the advantages and disadvantages for this application for each? I've learned the absolute basics of both languages but have not used either of them for the web.
I'd consider the Go Programming Language, but it doesn't look as community-supported.
TIA!
#Blender and #Barranka are right. Both languages are equally well supported on App Engine, with only very minor differences. The web client libraries you mention should integrate equally well with either, and you won't easily be able to port code to iOS in either language. (If you also plan an Android app, that might be a vote for Java, but you didn't mention that.)
So the answer is, pick the language you prefer. Both will work.
(If you don't have a preference, maybe lean toward Python, since it's higher level and supports rapid development a bit better, and Java's advantages (performance, static typing, etc.) aren't as important for what you want to build.)
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 11 years ago.
I am a iOS programmer so confident with objective c. I have also naturally programmed in Java. I am branching out into android programming and was wondering about the different programming approaches.
I have read in many different articles that there are significant differences between programming in a Java style and a Objective c/ c++ style. Whilst I know the languages are different. I was wondering if there are any Java/Android/iOS programmers that could give any pointers for someone branching out in android?
Cheers
Yes, definetly, both architecture iPhone and android are very different, but there are many similarities also, you should start with simiilarites, which would increase your learning speed, later as need arise, you can learn differences also.
Like iOS application have AppDelegate as a single class, existed lifetime of the application. In android, though not mandatory but you can have an application class, which can work similarly as AppDelegate. In iOS screens are designed in nib files, android also has a gui tool to design screens. But its not simply drag-drop and you also supposed to understand screen size variety in android.
Android does not follow, MVC architecture fully, and it makes a big difference, but if you still want to follow MVC framework, there is an open source api, on google code, by whivh you can implement mvvc architecture in android applications.
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 11 years ago.
I am very new to programming. I am familiar with HTML, C++ and learning PHP to start a database.
I want to make a website which tracks a stock price. I have written various algorithms in Matlab however, MATLAB only has a to-Java conversion.
I was wondering what language would be the best to do a lot of calculations. I want my calculations to be done in real time and plotted. Would Java be the best language for this?
I can do the calculations in C++ but I don't know how to put the plots on the website. Likewise I believe I can do everything in Matlab but the conversion looks a little sketchy.
I would be very thankful if someone with experience with Java, or I also heard python, would comment on my post.
My advice: write the website code in Python with Django and do the calculations in Numpy/Scipy. Those two libraries provide a very Matlab-like API for heavy computations. Their performance is excellent. Matplotlib is the associated plotting library.
It's not so much the language that matters, it's making sure that you have a good mathematics library for it. MATLAB is neat because it does all that matrix math super fast for you, but of course you need to link it with another language like you said.
Your goal should be to either find a good math library for the language you like, or find a language with a good math library you like.
For What It's Worth: I know Python has NumPy (scientific computing package) and Sage Math (a libre Mathematica clone).
I think you can use PHP or Java Web.
I would do C++ and write them to a database, then using php you can grab them from the same database and show them online, otherwise then java can do all that but make sure all calculations aren't done on the fly since that will kill your server, especially with stocks that can turn into a lot of data.
if you want to plot data, then you may be able to pass off some of the calculation to the google chart api:
http://code.google.com/apis/chart/
As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance.
Closed 10 years ago.
I would like to build an Android application that, via an OCR library, should scan a picture extracting text from it .
What Java library should I use?
Don't know how good it is (it definitely needs to be trained first), but there is Ron Cemer's Java OCR library.
If you are looking for a very extensible option or have a specific problem domain you could consider rolling your own using the Java Object Oriented Neural Engine.
I used it successfully in a personal project to identify the letter from an image such as this, you can find all the source for the OCR component of my application on github, here.
try tesseract,
checkout this article
http://www.itwizard.ro/interfacing-cc-libraries-via-jni-example-tesseract-163.html
and this example
http://code.google.com/p/mezzofanti/
Edit:
some more facts
- tesseract is one of the best open source OCR used by google
- there is training data available for many languages
- mezzofanti is an android app that uses tesseract
- beware: OCR does use a lot of CPU power. trying to OCR a A4 page with your T-Mob G1 will take a lot of time and the result may not impress you ;-)
You can use the OCR feature from Google Docs.
Check the Documents List Data API http://code.google.com/apis/documents/docs/3.0/developers_guide_protocol.html#OCR