I am using Firebase ML Kit on android studio to capture an image and then detect text.
Currently I take the picture with the phone's camera and this image is displayed in the app. I click my detect text button and the text appears. But I would like to see some bounding boxes on the images that shows what Firebase ML kit is seeing.
I have found many ways to do this in Kotlin, but I am a new developer and I feel like I need to figure out how to do this in Java before completely changing my whole code to Kotlin.
Thanks
ML Kit provides an API to get boundingbox for recognized text:
https://firebase.google.com/docs/reference/android/com/google/firebase/ml/vision/text/FirebaseVisionText.TextBlock
Also, in the quickstart app, ML Kit provides an example about how to do it:
https://github.com/firebase/quickstart-android/blob/master/mlkit/app/src/main/java/com/google/firebase/samples/apps/mlkit/kotlin/textrecognition/TextGraphic.kt#L32
Related
I am trying to build an Android app with Android Studio(Java), that based on a tensorflow model(pretrained in python) is able to classify the emotion of the person standing in front of the device's camera. My app needs to take the frame, apply some opencv operations like facedetection, grayscale, resize, crop, and histogram equlisation, and after that, giving the result to the model, it will predict the emotion. I have an activity that is showing the camera, but i dont know how to do all this frame manipulation and prediction in the background, when on the phone's screen the preview of the camera and the tags of the predictions, are the only things showing.
I managed to do this on PC very easly, running on the max FPS of the webcam, so i think it should be possible on the phone too.
Is there any similar project that i can get inspiration from? or does anyone know a bit of documentation or a way to do something like this ?
I would suggest checking out the Google ML Vision API's, they can easily be integrated into an Android App and are very accurate. There are many API's like the image labeller and facial labeller. Just check them out here:
https://developers.google.com/ml-kit/vision/face-detection/android
Hope I could help!
I'm new to Augment Reality and not having a compatible device to run examples provided for ARCore. I am having a few questions and want them to clear before going further as I'm getting clear about those through any mean. The app I'm working over is gonna perform the following task.
Detect a logo from a product
Create a 3D model of it using AR
display the generated 3D model at the exact same surface
Here is a sample image captured from a box. I want to display the text and logo in the 3D model.
My Questions
is it possible to display both logo and text as a 3D model or AR
supports images only?
Should I use ARCore or OpenCV or any other to do the task? which one is efficient regarding time and memory to implement?
Maybe it would be a discussion-based question but I am literally unable to find a solution for it.
Thanks everyone!
If you do not have ARCore supported device, you can try Vuforia + Unity instead. Vuforia also supports image recognition and overlay with AR. Check out this tutorial for your use case.
If you still want to use ARCore, you should check out Augmented Images feature. The challenge here if your logo has a good score to be able to work nicely for tracking and overlaying AR.
You can check image quality/score with this tool.
i like to make feature detections using java android studio and in these past few day, i had been stuck on how to detect this kind of image using opencv.
Cause its my first time using openCV and the only document i can read is not really helpfull. I've already convert it to grayscale. The big questions mark is how can i make my apps to recognize the feature of this image.
so,for example i can make the label of this image as "abstractimage", so when i foto some of another image with the same pattern it will be detected as "abstractimage", but the detected image will be using on t-shirt, and random object with the same image based on this feature image. Can anyone help me?
I want to develop custom keyboard like Pie Control, iOS' AssistiveTouch, the things I want to say floating and round shaped keyboard. Here's mock-up written in JavaScript. and now I'm edit some xml in sdk softkeyboard sample. but this input method is occupying bottom area with gray screen. I want remove and transparent this.
Can I create based with sdk sample?
If develop without sdk, what class do I need to use?
Thanks.
Sorry for the terrible image below!
How to read text from image in android app.
Edited:
I want to detect text from image which i have captured from camera in android.
Is it possible or is there any library to read text from taken image.
This is not related to android. If you like to get text contained in image you will need
detect text position in image
perform OCR on found text
This task is not trivial and requires some computing resources. There are some OCR libraries around - like opencv, tesseract etc. I and others also develop pure java opensource solution:
http://sourceforge.net/projects/javaocr/