Language best for Text Processing [closed] - java

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 9 years ago.
Improve this question
I am working on Text Processing Project. Where I need to do many operaition on text provided to me. Like finding substring in the text (Using Regex), deleting garbage text (Using Exact or Regex match), Splitting Text into multiple parts. Currently I am using Java for this purpose. Just want to understand is Java most suitable language for such things or I should use some other language.

It's opinion based, but I'd say one language is as good as another for this. There are some great java libraries for text parsing. I've been using the Guava Joiner, Splitter and Charmatcher classes a lot recently for similar things. https://code.google.com/p/guava-libraries/

Related

Can I use a program via java? [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
My question is simple:
Is it possible to use a program via Java? Meaning, can I for example code in java exactly what I want my Microsoft Word file do? Add paragraphs, lists,import a picture, etc?
Short answer: Yes. Some programs allow you to use scripts and such, that is the beauty of open source software.
To actually edit the document you can use Apache POI which is an API for Microsoft Documents. Granted it cannot handle .docx but simple text documents. You may also want to take a look at docx4java which supports .docx, .pptx and .xlsx files.

choosing language to work on very large text files (up to some terabytes) [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 8 years ago.
Improve this question
I am working on a project which uses text files (.txt) for input, reading them line by line but this files can go as large as 1 terabytes. I know some languages/technologies which I used for similar problems, those are Java, Bash, Awk, and Python. But I don't know which one can work with such large file, and what kind on tricks and tweaks will be needed.
As long as you process your file line by line and you assemble some statistics, it doesn't really matter what tool you choose.
Java has some advantage in terms of speed, compared to scripting languages, but in the end it will be a difference only by a constant factor. What matters the most is the algorithm that you use to process the file.

JSON Parser Project [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 8 years ago.
Improve this question
I'm thinking about starting an open source JSON parser library in Java for fun and coding practice. Would this be useful for people? Would this have more adoption than an XML parser?
There has always been a fair amount of backlash against XML, but this has really heated up since JSON became popular. A lot of people want to throw XML out the window and use JSON for everything, and for many problems, this is a reasonable thing to do. JSON and XML have many similar properties, but XML has some tricks up its sleeve that JSON can't yet match. you can find more useful answer in the below JSON and XML comparison and https://stackoverflow.com/questions/3536893/what-are-the-pros-and-cons-of-xml-and-json

Best way to parse some HTML code in Android [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 9 years ago.
Improve this question
I'm currently developing a small application for my parents' restaurant. The restaurant has a webpage where the clients have a private zone where they can see the status of their account after they log in.
Let's assume I log in succesfully and now I can see that private HTML after login. There, i'd like to parse some data.
My question is, I think, quite simple. I know I can use JSOUP for this task but I can also manage to do it by using Strings and Substrings.
Which method is best and why? I've looking for a while and I can't decide which to use.
Thanks in advance,
Alex
Jsoup is a better method, it is a library built for html parsing.
Using strings and substrings can be very error prone and lead you to writing a lot of code for something an html parsing library can already do easily.

Proguard alternative for obfuscating android apps using precompiled third party libraries [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 9 years ago.
Improve this question
I have tried ProGuard many times and spent a lot of time on configuring it to no avail in the past year or so. It seems to have some inherent limitations related to pre-compiled third-party libraries as explained in its document. ProGuard document nicely provides a long list of alternatives. Instead of trying them one by one, I am here to see if any could share successful experiences with some of them. I only need something to obfuscate java code. Code shrinking and optimization are nice to have, but not required.

Categories

Resources