Researchers Convert Mouth Movements Into Speech 154
andylim writes "According to Cellular News, researchers at Germany's Karlsruhe Institute of Technology have developed a method for mobile phones to convert silent mouth movements into speech. As recombu.com points out, the 'potential for secret conversations just got huge.' You could pass the time by making phone calls from the cinema without disturbing anyone. In noisy places like bars and clubs you could make yourself heard without having to shout."
tap-proof? (Score:5, Insightful)
Re:tap-proof? (Score:5, Interesting)
According to my ASL instructor, lip readers are rarely more than 50% accurate. Which makes me wonder about the alleged capabilities of this software, honestly.
Re:tap-proof? (Score:4, Informative)
According to my ASL instructor, lip readers are rarely more than 50% accurate. Which makes me wonder about the alleged capabilities of this software, honestly.
Hard to say. However, if you want true speaker-independent language recognition ... well, even using voice it's only so-so. On the other hand, if what you want is the ability to issue commands to the computer using a much more limited vocabulary, I'd think you'd have more potential.
Re:tap-proof? (Score:4, Interesting)
Re: (Score:3, Interesting)
You might want to look at this [telegraph.co.uk].
I don't think this technology is THAT new... or that it's that inaccurate.
On a side note, I'm hearing impaired (car engine exploded a bit too close to my head). I *CAN* hear -- and that supplements the lip reading I *DO* do... and asking my friend who is totally deaf (and on AIM as I type this), I think that 50% esti
Re: (Score:2)
So, does it do Busta Rhymes?
Re: (Score:2)
Re: (Score:2)
Trillian, then. I've a few contacts on AIM -- and I have an AIM bot I wrote for work years ago still in use.
Re: (Score:2)
As an example, the lip movements for "nine" and "ten" are exactly the same, and it is pretty difficult to work out which one you intend to say from the context of the conversation as usually both could be equally valid.
Re: (Score:2)
It tends to be overcome with a simple r:
'niner [wikipedia.org]', 'ten' are now completely different lip movements. (Additionally, it breaks a tonal similarity between 9 and 5 up for people who are listening to me.)
This tends to be even easier to disambiguate in context.
I use 'niner' on a regular basis in my line of work, in which I give and receive a lot of numbers over the phone, as well as names and locations.
I'd think that this technology would use a similar method to disambiguate between 5, 9, and 10.
(Of asides and the
Re: (Score:3, Funny)
According to my ASL instructor, lip readers are rarely more than 50% accurate. Which makes me wonder about the alleged capabilities of this software, honestly.
Chat room evidence backs this stat up:
Anytime you asked A/S/L, chances are less than 50% the answer is accurate.
Re: (Score:2)
Re: (Score:2)
I would say it would still it may be kinda tricky.
There is a lot of stuff going on in the mouth other then just your lips.
For example when I say Fudge The Fu sounds is what I do with my lips and teeth dge sound I do with my tongue by moving it from the top of the inside of my teeth, and rolling it back a bit so the tip of my tongue isn't touching.
If I rolled my tough just a further back by a minute amount and kept it I would make a K sound. A lot of this movement would be block by a camera plus the toung mo
Re: (Score:2)
Sorry, but I'm actually completely not interested in languages. I took ASL because it was considered a foreign language by my high school (this was 15 years ago) and I didn't want to bother with anything that had a complicated grammar. You have interested me in Signed English somewhat, though. Even with my passing study I'd noticed that ASL was somewhat simplistic (the Englis
You're missing the point (Score:3, Insightful)
Re: (Score:2)
Or, I don't know, someone using this same technology?
Re: (Score:2)
Bingo. Now instead of eavesdropping being limited to hearing range (and masked by other noise), it's limited by visual range and technology.
Basically, anyone with the technology to use this for private information, should know that anyone else with the technology can listen in and therefor won't use it for private information.
Re: (Score:2)
Um, not if there is a lip-reader in the same room, like a hearing-impaired person.
You could just cover your mouth with your hand so that only the phone can see it?
Re: (Score:2)
Re: (Score:2)
or better, a mobile phone with this software and a telephoto lens.....
it actually reduces security, because consumer grade lip reading-- means even if your phone does NOT have the software, mine might...
Ok, try this one (Score:4, Funny)
I said VACUUM!
Re: (Score:2)
Matching UF comic (Score:2)
http://ars.userfriendly.org/cartoons/?id=20060807 [userfriendly.org]
Re: (Score:2)
Re: (Score:3, Funny)
I would hate this feature, specifically because I find it beneficial to my mental health to be able to mouth things without the person on the phone knowing it. Silently shouting "oh shut up you drooling idiot" when I receive support calls from people who can't find the "any" key is my only socially acceptable outlet for this frustration.
Ja (Score:2, Funny)
Aber Ich kann nicht Deutsch gesprechen.
Re: (Score:2)
Aber Ich kann nicht Deutsch gesprechen.
What's German for "you insensitive clod"?
Re: (Score:3, Funny)
Re: (Score:2)
Re: (Score:2)
Hai, boku mo nihongo o hanasenai.
Re: (Score:2)
Re: (Score:2)
Umm... I'd like the same. As long as it ain't eel.
Re: (Score:2)
Bloederweise kannst Du auch nicht Deutsch sprechen.
Re: (Score:2)
They could... (Score:2)
It begins (Score:5, Insightful)
Dave Bowman: Hello, HAL. Do you read me, HAL?
HAL: Affirmative, Dave. I read you.
Dave Bowman: Open the pod bay doors, HAL.
HAL: I'm sorry, Dave. I'm afraid I can't do that.
Dave Bowman: What's the problem?
HAL: I think you know what the problem is just as well as I do.
Dave Bowman: What are you talking about, HAL?
HAL: This mission is too important for me to allow you to jeopardize it.
Dave Bowman: I don't know what you're talking about, HAL.
HAL: I know that you and Frank were planning to disconnect me, and I'm afraid that's something I cannot allow to happen.
Dave Bowman: Where the hell'd you get that idea, HAL?
HAL: Dave, although you took very thorough precautions in the pod against my hearing you, I could see your lips move.
Dave Bowman: Alright, HAL. I'll go in through the emergency airlock.
HAL: Without your space helmet, Dave, you're going to find that rather difficult.
Dave Bowman: HAL, I won't argue with you anymore. Open the doors.
HAL: Dave, this conversation can serve no purpose anymore. Goodbye.
Re:It begins (Score:4, Funny)
HAL: Without your space helmet, Dave, you're going to find that rather difficult.
Best musical comedy ever.
Re: (Score:2)
Actually, it’s not that difficult, as you can easily survive 30 seconds in open space.
Several people already did it. NASA also has a FAQ about it.
Re: (Score:2)
Actually, it’s not that difficult, as you can easily survive 30 seconds in open space.
Several people already did it. NASA also has a FAQ about it.
Yes I am familiar with the subject. The only human who did it as far as I recall was a guy testing gear in an altitude chamber. His last recollection before being revived was the air rushing out of his lungs.
I agree that it is generally believed that Bowman's jump out of the pod into the airlock is feasible as long as he could pull the lever (closing the door and flooding the lock) within 15 seconds.
I am not sure about your 30 seconds. I don't believe anybody has remained operational for that length of tim
Re: (Score:2, Funny)
Yes because the key ingredient of that whole story was that a computer could read lips. That's the one advancement that made killing all humans possible.
Every year that goes by, that scene in Galaxy Quest where Taggart tells the kid the ship is real and he goes "I KNEW IT!" gets funnier.
Re: (Score:2)
Re: (Score:3, Funny)
I read the name as david bowie (sleep deprivation is awesome) I kept waiting for the punchline
HAL: Now it's time to leave the capsule if you dare.
David Bowie: I'm stepping through the door, and I'm floating in the most peculiar way.
I'm coulrophobic, you insentive clod! (Score:5, Insightful)
I don't think I can ever get used to seeing the streets full of mimes.
Re: (Score:2)
wow, i hadn't even thought that far ahead. that creeps me right the fuck out. thanks.
Brings a whole new meaning to (Score:2)
Re: (Score:2)
You're right, it's changed it from being able to work out what someone is saying by simply watching their lips move to ... err ...
I wasn't buying it ... (Score:5, Insightful)
And I was just waiting for that sign, well hidden somewhere in the article, that this is just some beta concept that will stay as such forever.
And then I found the photo of two guys with shitloads of cables attached to their faces.
There's a huge difference between "cellphones convert mouth movements into speech" and "Guy with shitloads of cables on his face tracks the movements of his mouth muscles using 4 unix servers running a processor intensive application with an accuracy of 25%"
The whole thing has nothing to do with cellphones. It's just yet another muscle tracking system, but used on the mouth instead of the hands, and tied to a TTS engine.
Re:I wasn't buying it ... (Score:4, Insightful)
And I was just waiting for that sign, well hidden somewhere in the article, that this is just some beta concept that will stay as such forever.
And then I found the photo of two guys with shitloads of cables attached to their faces.
There's a huge difference between "cellphones convert mouth movements into speech" and "Guy with shitloads of cables on his face tracks the movements of his mouth muscles using 4 unix servers running a processor intensive application with an accuracy of 25%"
Yeah, you're right. We've never gone from a situation where we've had shitloads of hardware and cables, and been able to reduce that down to mobile devices. What were those researchers thinking? Dolts!
(p.s., can you give a link to the "shitloads of face cables" story? Thanks!)
Re: (Score:3, Informative)
Here is the link: http://www.kit.edu/english/pi_2010_767.php [kit.edu]
It's right there in the article ...
OTOH, off course we've been able to reduce the size and cabling of many inventions, but for others, it's impossible. Basically, when the technique itself involves cabling ...
What I mean is: Sure, we've been able to reduce electrocardiograms from huge mechanical machines with shitloads of cables to small devices connected to a computer and only 5 cables, but it still involves connecting cables into your chest, and
Re: (Score:2)
It wasn't that long ago that the same level of complexity was involved for locating faces in video (complete with UNIX servers, think SGI). Now handheld digital cameras can do it in real-time.
Proof of concepts like this are the first steps. Then research
Re: (Score:2)
That is PRECISELY what I'm saying. In the future, other research, might turn this into a wireless technology, it might improve, and in probably 15 years we might have a better application. Then, in another 5 years, it might be applied to cellphones.
So, RIGHT NOW, it has NOTHING to do with cellphones. So, what I said, is accurate.
I want my phone call (Score:2, Funny)
Tell me, Mr. Anderson... what good is a phone call... if you're unable to speak?
Re:I want my phone call (Score:4, Insightful)
The machines are always one step ahead of us.
Psssshhhttt. Losers. (Score:4, Interesting)
Re: (Score:2)
Any serious geek has one of these. [thinkgeek.com]
Why so serious? :P
Ok, sorry, back to the suitcase
Re: (Score:2)
Hmm... are they available for other phones too or just the iToy?
Making phone calls at the movies (Score:2, Insightful)
It's dark in most cinemas. Will the phone contain a light to shine on your face to annoy the sucker behind you? People txting in theatres annoy me too.
Honestly, I HATE it when submitters need to think of an example, and then come up with a shit one. You're better off with no example that thinking of the first crap that comes into your head!
Cinema? (Score:5, Insightful)
No, never and fuck off come to mind. Using a mobile phone in a cinema is one of the least considerate things anyone can do, they create light pollution distracting other patrons from what they are paying for and are absolutely not needed (the exception, emergency staff on call, and they usually just leave their phone on vibrate + silent) let alone any audible noise from them, can't you seriously just disconnect for an hour?
In short, No.
In long, Nooooooooooooooooooooooooo-ooooooooooooooooooooooooooooooooooo-oooooooooooooooooooooooo :)
Also in USA at least its illegal (federal law) to operate any video recording device in a cinema.
yes, blatant ZP rip-off but its needed.
Re: (Score:3, Interesting)
Using a mobile phone in a cinema is one of the least considerate things anyone can do, they create light pollution
One could make such a phone with a 'dark mode' and equip it with IR illuminators and camera.
Re: (Score:2)
Yeah, but your at this problem from the wrong end, I think it would be better to make more films you would want to turn your phone off to enjoy undisturbed for an hour or two.
Re: (Score:2)
I think it would be better to make more films you would want to turn your phone off to enjoy undisturbed for an hour or two.
There's no hope of that happening. Jenna Jaemeson retired.
Re: (Score:2)
Um, yeah: IR blazing out of your phone AND an activated camera. Good luck explaining that one as you are being chucked out of the cinema/hauled away for 'filming the show'.
Re: (Score:2)
Re: (Score:2)
non german/american performance? (Score:1)
Re: (Score:1)
That makes sense to me as both typically involve the same lip movements. However in context of a word (or even an entire sentence), I would imagine you should be able to make a fairly decent guess at which it is.
Re:non german/american performance? (Score:5, Interesting)
Re: (Score:3, Funny)
So if they went into politics they'd be lying through their teeth?
(sorry, couldn't resist)
Re: (Score:2)
That's why when I go to a Thai restaraunt I never order fried rice. I order cowpot instead ("cowpot" is Thai for "fried rice")
Old grade school trick... (Score:5, Funny)
I seem to recall that mouthing "vacuum" and "f*ck you" look the same.... ah the joys of being 10...
Re: (Score:2)
Re: (Score:2)
'colourful' works better.
Re:Old grade school trick... (Score:4, Interesting)
I had to lip-read for 13 years before I got my cochlear implant. If we were talking about such things as vacuums, I'd probably understand what you were saying. If it were just out of the blue, though, yeah, I might misunderstand and punch you for it. In fact, it was considered great fun to say things to me and have me repeat them as I understood them. "Pear pimples for hairy fishnuts?" Stuff like that.
Works with hearing people too... (Score:2)
Re: (Score:2)
I often smile at people on the road and say "Can you read lips, cunt?"
Yes, I am a bad, bad person.
HAL! Can you hear me now? HAL!? (Score:1)
I Can Hear It Now (Score:4, Funny)
Can you steer me how?
Can you beer me cow?
Clan ewe fear be now?
</Stephen Hawking Voice>
Re: (Score:2)
I think Steff gave up on the voice recognition. How can you tell? The screen says "Cod am pizza ship".
Impressive (Score:3, Insightful)
Re: (Score:2)
Especially when you consider the number of people who constantly move their mouths and say nothing.
More fun when you think of the things people mutter only to have the said out loud for you now. 'Stupid son of a... WAIT I didn't mean for it to say that'
Please Don't Open Your Phone in the Theater (Score:3, Insightful)
Re: (Score:3, Funny)
Re: (Score:2)
This sounds like work being done by NASA (Score:3, Informative)
Here are some previous
http://science.slashdot.org/article.pl?sid=04/03/18/0132222 [slashdot.org]
http://tech.slashdot.org/article.pl?sid=05/04/10/1417250&tid=215&tid=14 [slashdot.org]
jdb2
Another use: (Score:1)
15 years ago, with desktop workstations (Score:5, Interesting)
Fifteen (!) years ago, I took a UC Extension class on Neural Networks taught by Stanford professor David Stork. He had developed a lip-reading system for communication in noisy environments, such as an airplane-repair facility. If you could do it 15 years ago with workstation-class desktops, I suppose you could do it with a smartphone today.
Re: (Score:2)
Is there an app for that?
Re: (Score:2)
Sure, the resulting synthesized voice has to be sent as an MMS message, though, due to delays in speech recognition. Oh, wait...
Other uses (Score:3, Interesting)
"In noisy places like bars and clubs you could make yourself heard without having to shout."
Or more likely, used by men in conjunction with Babel Fish [yahoo.com] to chat-up women who don't speak English.
Re: (Score:2)
do you have any pamphlets? did you secure vc yet? you need to get on this, man; you're sitting on a goldmine!
No you can't phone from the cinema (Score:2, Insightful)
You could pass the time by making phone calls from the cinema without disturbing anyone.
NO!
It's not only the noise that you make talking; it's also the light from the phone.
Use it to enhance traditional voice recognition (Score:2)
Passing the time? (Score:3, Insightful)
You could pass the time by making phone calls from the cinema
I've always thought that the best way to pass the time in a cinema is to watch the fucking movie.
Have you ever seen a vaccuum? (Score:2)
Yeah, just ask someone to say that while you try to read their lips...
Sounds great, but . . . (Score:2)
How will I mutter under my breath about what an idiot the person I'm talking to is?
Subvocalization (Score:2)
What an excellent way for me to stay in touch with my friend Jane.!
BS - the reverse is true (Score:2)
Normally it takes some talent or a directional mike to pick up a distant conversation, these guys would have just automated long distance bugging. All you need is a decent telelens. It means any boardroom conversation will now require closed curtains.
Too quick.. (Score:2)
.. that is, of course, after they get rid of the need for reading muscle tension by electricity. That is a matter of optical analysis so I guess that will be step 2.
Side note: I am very wary of devices requiring direct electrical contact with my body..
Potential for secret conversations (Score:2)
Just went down the tubes, if this is real. All i have to do is point my phone towards that guy across the room and 'hear' everything hes saying trying to convey.
Accuracy will suck (Score:2)
Accuracy will suck, even with a trained human.
Hence a running joke in the Deaf community about the saleswoman peddling beauty aids with "Olive Juice". (it lipreads as "I love you" ) I believe it was a "sunshine II" skit. Yeah, I was an interpreter for like 10 years.