Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 3 years ago.
Improve this question
I have 1000s and 1000s of PDF articles from which I need to extract only author name and his relevant details like address and email ID and whatever provided inside the PDF (I mean the content inside). I don't want to do this by getting the details associated with the metadata of the PDF. Since I tried that where I end up with only less details like author name, title and some other usual details which I do not need at all.
I have gone via all APIs in internet, but still I did get the solution. I need to do it in Java.
I think you can't get it directly from any library.
Use iTest library for reading pdf. Once you are able to read text find the Author using regular expression.
Related
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
My question is simple:
Is it possible to use a program via Java? Meaning, can I for example code in java exactly what I want my Microsoft Word file do? Add paragraphs, lists,import a picture, etc?
Short answer: Yes. Some programs allow you to use scripts and such, that is the beauty of open source software.
To actually edit the document you can use Apache POI which is an API for Microsoft Documents. Granted it cannot handle .docx but simple text documents. You may also want to take a look at docx4java which supports .docx, .pptx and .xlsx files.
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 6 years ago.
Improve this question
I have a question.
I want to parse XML file using SAX or JAXB and save the parsed file into a database using java, any help please.
Even though you should research the topic first, I am giving you a starting point. Basically you'd use unmarshalling to create a java object from an XML (see http://lstierneyltd.com/blog/development/examples/java-to-xml-xml-to-java-marshalling-and-unmarshalling/). Then you could use some i.e. Apache Cayenne or Hibernate to store the object to a database.
I would recommend to refrain from putting the XML to a relational database itself if you expect significant amounts of data.
Closed. This question needs details or clarity. It is not currently accepting answers.
Want to improve this question? Add details and clarify the problem by editing this post.
Closed 7 years ago.
Improve this question
I do not know if this is the right place to ask this question, but I have been wondering, what languages can one use to create a file format, I am working on an android app which reads and writes files, I want to create a unique file format which it only can read, I know each format uses different algorithms, that is why am asking if anyone knows any references I can look at, I can not seem to find any documentation online on this topic.
Any language can be used to create any kind of files if you know the encoding and encryption type of those file formats. If you really need a unique type of file for your application, simply save serialized objects or encrypt and save the serialized stream and use any desired extension.
Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 9 years ago.
Improve this question
I'm currently developing a small application for my parents' restaurant. The restaurant has a webpage where the clients have a private zone where they can see the status of their account after they log in.
Let's assume I log in succesfully and now I can see that private HTML after login. There, i'd like to parse some data.
My question is, I think, quite simple. I know I can use JSOUP for this task but I can also manage to do it by using Strings and Substrings.
Which method is best and why? I've looking for a while and I can't decide which to use.
Thanks in advance,
Alex
Jsoup is a better method, it is a library built for html parsing.
Using strings and substrings can be very error prone and lead you to writing a lot of code for something an html parsing library can already do easily.
Closed. This question needs details or clarity. It is not currently accepting answers.
Want to improve this question? Add details and clarify the problem by editing this post.
Closed 8 years ago.
Improve this question
I want to get exact video link from some streaming video resources like you-tube or metacafe.com, As there is no direct link of video is available. How can I parse the video
link. Also how to use java to download a streaming video from http url to local system?
Thanks!
I used a library called HTMLParser before. It can read html input and generate HTML nodes. You can then analyze and retrieve the node you like.