Japan Mobile Company Debuts Real-Time Voice Translation App


Recommended Posts

translate-app-rsz-1351712798327.jpg

Language barriers are starting to crumble. This month Japan's dominant mobile phone operator, NTT DoCoMo, introduced the world's first app for real-time voice translation. When a user with a DoCoMo smartphone places a call through the app, he speaks in Japanese and his words are promptly translated into English, Mandarin, or Korean. To complete the conversational circuit, the other person's words are translated from any of those languages back into Japanese.

With this debut we've taken one step closer to building a mechanical Babel fish, the extraordinarily useful creature imagined by Douglas Adams in The Hitchhiker's Guide to the Galaxy. As any lover of sci-fi knows, the Babel fish is a leech-like critter that is inserted into the ear and lives in the brain, where it feeds on brain waves and provides simultaneous translation of any language in the universe. NTT DoCoMo's app can't match that universal utility with its current limit of four languages?but at least you don't have to slip something slimy into your ear to make it work.

AT&T's research lab showed off its own translation service earlier this year, but NTT's is further along and seems better integrated into the phone call itself.

The free DoCoMo app relies on the cloud for the heavy processing, namely speech recognition, machine translation, and voice synthesis. According to a NTT DoCoMo newsletter (not online, sorry), the app's reliance on the cloud allows for unobtrusive upgrades and the most important feature, near-instant translation:

Trials have shown that the average processing time takes just about two seconds, fast enough for a reasonably natural conversation under the most unnatural of conditions, i.e., two people conversing easily without understanding each other?s language!

To test the app, the company gave out a beta version that handled Japanese and English to tourist facilities, retail companies, and hospitals. NTT DoCoMo says the trial app had about 90 percent accuracy in recognizing Japanese words, and about 80 percent accuracy in recognizing English words.

The company didn't say how accurate or artful the translations of those words were, though, so I asked for a demonstration. Spokesman So Hiroki graciously complied, and on Tuesday evening my desk phone rang. When I picked it up, a recording told me that this was an automated translation call, and that I should press 0 to continue. Then I heard a man say "Moshi moshi," a gentle chime, and then a soothing woman's voice (not unlike the lady who lives inside many car navigation systems) say "Good evening!"

I quickly discovered that the system is great at pleasantries, not so great at more complicated communications. At one point I asked Hiroki and his colleagues on the call which languages would be added to the system next. The English answer I got back: "It is European edition such as French and German to challenge next."

Still, it was an impressive demonstration, and the team declared their determination (in grammatically correct and understandable English!) to improve translation precision. According to Hiroki, NTT DoCoMo spent two years developing this service because they're looking for ways to fight an alarming trend for the telecom industry: the rapidly declining rate of voice calls.

In another mode, the app can also be used when two people meet face-to-face: They speak their respective languages, and the app provides both voice translation and text on the phone's display screen.

http://spectrum.ieee.org/tech-talk/consumer-electronics/portable-devices/japan-mobile-company-debuts-realtime-voice-translation-app

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Google's new hand-wave reCAPTCHA can be bypassed with a stock photo by Ivan Jenic Image: Screenshot Google is testing a new reCAPTCHA method that asks you to wave at your camera to prove you're human. So, besides solving puzzles and reading distorted text, you can now use your computer’s camera to pass the verification test. When the hand gesture verification is triggered, your browser asks for camera access and prompts you to perform a simple gesture, like a wave or an open palm. Google says it records a short video of the movement and uses AI to extract 21 hand-knuckle coordinates to complete the verification process. The video is then immediately deleted, and Google swears it doesn't keep it. The process alone can be uncomfortable for people who wouldn’t want their biometric data, which hand scans technically qualify as, recorded. But it gets even more nuanced, as early testers discovered that the new hand-waving reCAPTCHA can be passed with a simple stock image. A user on X tested the new challenge using a stock image of a hand fed through OBS Virtual Camera, and it passed. I wanted to verify it, so I tried the same thing. It took me a few tries and a few stock images, but in the end, I was also able to pass the test. I simply had to readjust the stock image of a generic person waving inside OBS, and Google’s mechanism registered it as a legitimate hand gesture. Once again, it didn’t even have to be a video or an AI-generated hand animation. Given the simplicity of the process, the entire action can be automated in minutes. All it takes is a simple Python script to render the new reCAPTCHA method obsolete. And it doesn’t even have to be an AI bot, which is usually used for solving puzzles and other verification methods. The new reCAPTCHA method is still in its early phase, and Google will, hopefully, update its AI to at least reject still images. However, this incident, combined with users’ initial skepticism about Google’s practices regarding user data, likely won’t make too many people wave at the camera anytime soon.
    • 🤣🤣🤣🤣🤣 "to fund healthcare and tuition" 🤣🤣🤣🤣 Who do you think you are talking about, some COMMUNIST? We are better than them, doG bless Murica!!! p.s. I'm from a country where government does exactly that, i.e. not form US.
    • Apparently not. I know it is on Edge for business at the moment, but how long will it be before it become on the home version of Edge?
    • Microsoft details new Edge for Business security features, including AI-powered scareware detection So Edge is adding a "scarecrow." Will it be animated?
    • I have this one and it's great, also paired with a Mac. I like the white back aesthetics of it and ability to have all your wireless usb peripherals under a clean lid. 4K @ 120 Hz and 65W usb-c charging is not bad even at its typical price point. The U series is probably better for commercial photo work though; IIRC one reason this one is priced in a different bracket is because it's not calibrated and verified for optimal color accuracy. Not something I think of in daily use, coding, and light gaming though.
  • Recent Achievements

    • Apprentice
      Asgardi went up a rank
      Apprentice
    • One Month Later
      sunrisea2milk earned a badge
      One Month Later
    • Week One Done
      sunrisea2milk earned a badge
      Week One Done
    • Week One Done
      Snow Day Calculator Alert earned a badge
      Week One Done
    • Conversation Starter
      KMilenkoski1202 earned a badge
      Conversation Starter
  • Popular Contributors

    1. 1
      +primortal
      495
    2. 2
      +Edouard
      251
    3. 3
      PsYcHoKiLLa
      154
    4. 4
      Steven P.
      86
    5. 5
      macoman
      65
  • Tell a friend

    Love Neowin? Tell a friend!