Capturing incoming audio from Ekiga softphone/Pulseaudio JACK sink - java

I want to do speech recognition using Sphinx.
I'm looking to capture the output/incoming audio of the Ekiga VOIP softphone using Java or Python and pass it on to Sphinx. Right now, the output is directed toward the PulseAudio JACK Sink.
Sorry if I am not using the right terminology as I am quite a noob at this.
If you could point me in the right direction, it would be much appreciated, thanks.

You need to properly formalate the task you want to achive and select the right tools for that. Software developers do not use desktop applications like Ekiga for their projects. It's just because desktop applications are written for the users, they are not for software developers. You can only modify desktop applications like Ekiga in the source code, but that is a whole different story.
If you want to have a VOIP endpoint and connect it to a speech recognition service, you need to look on IVR systems and similar things. IVR system is a specific tool to implement complex interactions over the VOIP protocols. Some examples are:
Freeswitch
Asterisk
GNU Gatekeeper - a project is built on the same Opal technology as Ekiga
You can setup freeswitch to wait for the calls and pass them to the pcoketsphinx. You can do the same with Asterisk. For more details see
How to use Pocketpshinx from Freeswitch
How to integrate Pocketsphinx in Asterisk using UniMRCP project

Related

Emulating Android Drone Control App On Windows

I recently purchased a Yuneec Breeze quad-copter drone that uses an Android and iOS app with its own WiFi connection as a controller.
I am looking to write a Java application that performs the functions of the Android app with Game Controller support (the Breeze Cam app uses the touchscreen which means my big thumbs cover the streaming video) but am running into a roadblock.
While being proficient in Java, including network functions, I find myself woefully uninformed as to how to capture and read the network traffic between my phone and my drone. Its simply a skill I never needed until now.
Any place to help me get started?
OR -
Since I'm writing this in Java would de-compiling and hacking away at the Android source be a more viable option?
If so what decompilers are recommended?
In my experience these apps tend to be written in proprietary code. Android may be an open-source project, but the apps written for it don't have to be. It's unlikely you'll be able to write your own code to control the drone unless Yuneec provides an API/SDK to do so.
A quick Google search yields this thread on their forums: https://yuneecpilots.com/threads/api-sdk.745/
It's not much info, but it looks like you're SOL. Sorry.

Is it possible to use MARY TTS engine on android application? Has anyone attempted this? Where would be a good place to start?

I'm interested in implementing the MARY TTS engine on Android, specifically
to use some of the advanced features available to the voices (such as
inflection, duration, pitch, etc). Ideally I would create a self-contained
Android application with a front-end (not MARY) to handle the text input
and generation, and a back-end (is MARY) to handle the Text-to-Speech task
and pass the audio files back to the front-end.
Has anyone attempted this? Where would be a good place to start?
We have already start to solve this task with integrating MaryTTS java code to Android. Can tested from here
https://github.com/AndroidMaryTTS

Java Voice Biometric

I want to develop an application based on Voice Biometric Recognition.
Specifically, I want to develop an application which will record a voice from the telephone, and identify the speaker. If the same person calls again it will recognize the voice. Like other Biometric applications do here my need is to do a voice biometric. Are there any URLs or examples which will help me. I searched but not able to find a solution.
FreeSpeech is a text-independent speaker verification system that verifies a caller's identity
I want to achieve the above one FreeSpeech Recognition in my application.
Is it possible to do the below things by using any Open Source.
The individual records a voice print, then
The system keeps track of the voice prints and can distinguish recordings from live speech
If yes, can you please provide me a URL or example which will help me.
Well, I got the light from This Url to achieve the above task but not able to get the expected out put.
After wasting 20 to 25 Hrs, Finally I got the solution by using MARF Framework.
I got the sample app from the http://sourceforge.net/projects/marf/files/Applications/%5Bf%5D%20SpeakerIdentApp/0.3.0-devel-20060226/
And for now, it's working fine for me. This links is very useful for me to make the sample app executable. http://marf.sourceforge.net/
You can take a look at this previous SO post in which various Java Speech Recognition Engines are described such as Sphinx.
I am not an expert on this domain so please take my answer as is , it's not an authorative one... I think you have different ways to achieve your goals :
- finding a Java library is one , the most natural one
- recording the voice in Java then applying one of the several algorithms available for such job , you may find many research papers dealing with that subject
- depending from the architecture choices, you may find different libraries implemented in C dealing with voice signal, using JNI or JNA is one way to deal with C/C++ libraries, Web Services or CORBA are other ways to achieve this....
HTH
Jerome

Automatical translation in desktop application

I run a chat service where people of all nationalities are welcome. The problem is that they can't communicate. I don't have enough users to create different chat rooms for each language.
I'd like to provide an automatical real-time translation integrated in the chat, which is programmed in plain Java.
I don't want to hack google translate or do something like that.
What are my options?
There are no options unless paying a lot of money for this service.
There is a Java client API for Google Translate (but it is an unofficial one). You can find it at http://code.google.com/p/google-api-translate-java/

Open-source p2p videoconferencing in Flash or Java?

I want to build a community website with videoconferencing functionality integrated. I would prefer to provide this from within the browser, so I'm looking for a Java- or Flash-based solution.
Also, it would be nice to spare bandwidth by having the clients stream their audio and video data without using a central server (like the way Skype works, for example).
Is there a reasonably mature open-source project that meets these criteria?
When doing Voice over IP, UDP is preferred, and web browser plugins usually lacks support for UDP and P2P-connections.
But there is a solution. You have to use Adobe Flash Player 10 and the Adobe Stratus for this. This is actually what Chatroulette is using for P2P streamed video chat.
Take it from someone who found out the hard way: Java's support for video playback is pisspoor. They keep promising a new, fantastic video playback system, but meanwhile we are STILL stuck with Java Media Framework, the platform-limited, MPEG-4 incompatible mess.
Your alternatives are Quicktime for Java and systems wrapping MPlayer/VLC/etc. None of those are really options for a browser.
Much as I hate Adobe and the evils of Flash site design, Flash is far and away your best option.
See the Xuggler library -- it includes code for playing back video (well, sample code) in the com.xuggle.mediatool.IMediaVieer object.

Categories

Resources