Here is a video of Siri and TellMe in use side-by-side


Recommended Posts

Why should I have to adjust everything I know in order to use a tool? Surely a tool and its subset of features should be designed around the way we speak?

Again, if you'd read the thread you would see that this has already been discussed. "Text ccuk", "Call ccuk", "Find ccuk" are all perfectly natural things to say so you're hardly being forced to compromise. Anyway, whether you think natural language is important is irrelevant, TellMe doesn't offer it and he shouldn't try to use the software incorrectly. It's the equivalent of complaining that you can't view Twitter feeds in the iPhone's contacts application - iPhone doesn't offer this feature so there's no point claiming that it's broken.

TellMe performs at least some of its voice recognition on the phone. I've just turned off all data connections on my phone and TellMe can launch applications, call and text contacts. The only thing it can't do is search using Bing.

That's more than can be said for crappy vlingo on android at least :p driving along the motorway with a skittish data connection renders it nearly useless.

I know you've already conceeded that the recognition isn't as good but that is exactly the reason for the failure in this video, it's not because he used invalid commands.

I think it is based on my experience of TellMe. Recognition is better when I use the correct commands or use natural language in the right place. If I say "Text Jake" it will generally text me. However, if I say "Send a text to Jake" it fails every time.

Again, if you'd read the thread you would see that this has already been discussed. "Text ccuk", "Call ccuk", "Find ccuk" are all perfectly natural things to say so you're hardly being forced to compromise. Anyway, whether you think natural language is important is irrelevant, TellMe doesn't offer it and he shouldn't try to use the software incorrectly. It's the equivalent of complaining that you can't view Twitter feeds in the iPhone's contacts application - iPhone doesn't offer this feature so there's no point claiming that it's broken.

See you keep dodging around points. Regardless of what you are saying the system couldn't understand the words he was speaking, context aside. I personally don't think it's as good as other offerings.

My WP7 works much better than that. Holy ****, I mean it's no Siri, but that guy was just terrible.

I wonder how long MS spent on research and development for accents which aren't American.

Also, can TellMe work with foreign languages?

I think it is based on my experience of TellMe. Recognition is better when I use the correct commands or use natural language in the right place. If I say "Text Jake" it will generally text me. However, if I say "Send a text to Jake" it fails every time.

You're probably right, but that seems like a crazy way of doing things. It shouldn't assume the correct command will be used every time as it's being spoken to by humans not machines. It should decipher the sentence and then decide if a correct command is used, not decide there's no valid command and crap out.

I wonder how long MS spent on research and development for accents which aren't American.

Also, can TellMe work with foreign languages?

Yes, TellMe works with foreign accents. But, TellMe is NOT directly comparable to Siri. Siri is a complete solution that listens to speech with context and performs and action. TellMe itself is just voice recognition, with no context. Context is provided by the application developer using the TellMe service. It'd be perfectly easy to create something similar to Siri using TellMe. For example, people praise Kinect's voice recognition - but that's also powered by the same TellMe service, but just with more context provided by the actual software on the Xbox.

And of course, what makes speech recognition works properly is context. Non-contextual speech recognition is never mind blowing, and it's what lets Siri distinguish between "10 am" and "teen anal". Siri is expecting a time to fit in with the rest of the sentence, so it assumes a time. Note that voice recognition doesn't usually return exact fixed words, they tend to return words accompanied by liklihood probabilities - i.e. it's 90% likely to be this, but maybe 70% sounds like this. With proper context in Siri's case it does better because it can looks at those at all those choices, and then look at the types of commands it accepts, and be able to choose appropriately. If Windows Phone was programmed to let you make appointments over speech, it would work fine. But it's not, so it doesn't.

In the end, Siri is programmed to recognise a lot more, Microsoft's Windows Phone speech is only programmed to recognise a tiny subset of commands, nothing to do with problems on TellMe's ends - it's just the say it's been programmed on Windows Phone. Hell, even if one company knows another companies implementation is better, they're not going to be stupid enough to tell people their competition is better are they @___@ Microsoft are hardly going to toot Apple's horn ever, and neither are Google :p

You're probably right, but that seems like a crazy way of doing things. It shouldn't assume the correct command will be used every time as it's being spoken to by humans not machines. It should decipher the sentence and then decide if a correct command is used, not decide there's no valid command and crap out.

A crazy way of doing things? Why? You're just telling TellMe to do something (command) and not having a conversation (natural human speech) with it. In my opinion, "text John" is a lot clearer, precise, and shorter than "Can you please send a text to John?" Don't get me wrong, Siri's speech abilities are good and based on this video, it's natural speech recognition is great but TellMe is not as bad as the video makes it seem as it just needs to be updated.

See you keep dodging around points. Regardless of what you are saying the system couldn't understand the words he was speaking, context aside. I personally don't think it's as good as other offerings.

I'm not trying to dodge around anything. You are just making points without having bothered to read the rest of the thread which is leading to confusion. Read what I've said before trying to pick my posts apart.

A crazy way of doing things? Why? You're just telling TellMe to do something (command) and not having a conversation (natural human speech) with it. In my opinion, "text John" is a lot clearer, precise, and shorter than "Can you please send a text to John?" Don't get me wrong, Siri's speech abilities are good and based on this video, it's natural speech recognition is great but TellMe is not as bad as the video makes it seem as it just needs to be updated.

You need to read the comment I replied to to see the context. Jakem1 was suggesting the entire sentence was misread because no correct command was used. He said the voice recognition is more accurate when correct commands are used.

I said that is a crazy way of doing things, because the way the software recognises speech shouldn't change based on whether the correct command is used or not. It's expecting human input so occasionally incorrect commands will be used and the software should account for that.

I'm not saying that commands are a bad idea, just that the voice recognition should be equally as accurate whether the correct command is used or not.

You need to read the comment I replied to to see the context. Jakem1 was suggesting the entire sentence was misread because no correct command was used. He said the voice recognition is more accurate when correct commands are used.

I said that is a crazy way of doing things, because the way the software recognises speech shouldn't change based on whether the correct command is used or not. It's expecting human input so occasionally incorrect commands will be used and the software should account for that.

I'm not saying that commands are a bad idea, just that the voice recognition should be equally as accurate whether the correct command is used or not.

Siri is still command based - it just features a lot more commands.

Unfortunately most current voice recognition systems ARE based on context. They don't tend to return exact matches of words. - they return a list of words and probabilities, and then the software forms a sentence using the probabilities provided by the speech recognition service, and the context and sentence structures the software supports. Speech recognition isn't advanced far enough to perfectly understand speech with no context. Siri is still ultimately command base, it's just more likely to get it right because it's implicitly programmed to recognise more variations of the same thing.

Siri is still command based - it just features a lot more commands.

Both Siri and TellMe can be used for transcribing. Siri is system wide, TellMe can be used inside the Text Messaging app. How accurate are both systems when doing pure transcription? I know from using Siri it's about 98% correct. How is TellMe though? If it's anything like when trying to tell it to use commands I'm not impressed.

Both Siri and TellMe can be used for transcribing. Siri is system wide, TellMe can be used inside the Text Messaging app. How accurate are both systems when doing pure transcription? I know from using Siri it's about 98% correct. How is TellMe though? If it's anything like when trying to tell it to use commands I'm not impressed.

It works perfectly well for me. The only things it messes up for me tends to just be odd names, but apart from that I've had no problems with it.

For what it's worth - Siri's use of Nuance is directly comparable to Windows Phone's use of TellMe for speech synthesis. So it's really Nuance speech synthesis vs TellMe speech synthesis for pure transcription, rather than Siri or Windows Phone :p

You need to read the comment I replied to to see the context. Jakem1 was suggesting the entire sentence was misread because no correct command was used. He said the voice recognition is more accurate when correct commands are used.

I said that is a crazy way of doing things, because the way the software recognises speech shouldn't change based on whether the correct command is used or not. It's expecting human input so occasionally incorrect commands will be used and the software should account for that.

I'm not saying that commands are a bad idea, just that the voice recognition should be equally as accurate whether the correct command is used or not.

I agree with you as I did read the comment and understand why you believe the software should account for any speech inconsistencies. You're right, speech software should understand both but to be honest I prefer the command way of telling my speech recognition software how to do things. However, I love the fact that companies are pushing the software in new directions as it benefits us the consumers.

Siri is still command based - it just features a lot more commands.

Unfortunately most current voice recognition systems ARE based on context. They don't tend to return exact matches of words. - they return a list of words and probabilities, and then the software forms a sentence using the probabilities provided by the speech recognition service, and the context and sentence structures the software supports. Speech recognition isn't advanced far enough to perfectly understand speech with no context. Siri is still ultimately command base, it's just more likely to get it right because it's implicitly programmed to recognise more variations of the same thing.

If that was the case then transcription would suck. It's clear to me that siri is just better..a lot better.

To text you say ... "Text" then the name for tell me.. its so dumb because this guy did not say the commands needed for tell me.. he would say something like.. send this person a text.. instead of just saying text and then the name..

Also how it works is that what ever you say without the commands, is just searched on bing.. i like this because i dont need to be like, "siri, where is the closest indian resaraunt".. I can be like "Indian Restaurant" and bing local shows me.

If that was the case then transcription would suck. It's clear to me that siri is just better..a lot better.

From that video? That video evidently proves how context makes recognition better. Find a video comparing direct voice transcription on a text message where context doesn't exist for a better idea of how they compare on that front.

From that video? That video evidently proves how context makes recognition better. Find a video comparing direct voice transcription on a text message where context doesn't exist for a better idea of how they compare on that front.

No it doesn't..not everything is context based, what about Bing/Google searches? What's the context there, words?? :laugh:

If contextual expectations made that big a difference to any voice recognition software then transcription would suck, but it clearly doesn't.

Siri is incredibly accurate at sending text messages, emails etc where the message can be anything.

Your explanation doesn't wash.

No it doesn't..not everything is context based, what about Bing/Google searches? What's the context there, words?? :laugh:

If contextual expectations made that big a difference to any voice recognition software then transcription would suck, but it clearly doesn't.

Siri is incredibly accurate at sending text messages, emails etc where the message can be anything.

Your explanation doesn't wash.

Did you not see the point where I said "in this video"? Windows Phone and Android do great transcriptions too of text messages, and side by side with an iPhone give largely similar results.

Did you not see the point where I said "in this video"? Windows Phone and Android do great transcriptions too of text messages, and side by side with an iPhone give largely similar results.

The windows phone tried to search bing for something completely different to what was said, and there's not really any contextual guesswork involved with Bing searches..logic dictates transcription would offer the same poor recognition for the man in the video.

The windows phone tried to search bing for something completely different to what was said, and there's not really any contextual guesswork involved with Bing searches..logic dictates transcription would offer the same poor recognition for the man in the video.

I think you may be missing Johnny's point but dictation is more accurate in TellMe than the stuff you've seen in this thread. Check out this video from 56 seconds on for an example of dictation in a text message:

Also, dictation into the Bing app for searching is mostly accurate although it does fall over on some names.

EDIT: It just occurred to me that (as I guess you'd expect) the dictation gets passed to a server for translation so that's one more thing that can't be done without a data connection. It might also explain why it's more accurate than some of the command-based stuff.

I think you may be missing Johnny's point but dictation is more accurate in TellMe than the stuff you've seen in this thread. Check out this video from 56 seconds on for an example of dictation in a text message:

Also, dictation into the Bing app for searching is mostly accurate although it does fall over on some names.

EDIT: It just occurred to me that (as I guess you'd expect) the dictation gets passed to a server for translation so that's one more thing that can't be done without a data connection. It might also explain why it's more accurate than some of the command-based stuff.

Unfair comparison..the person in that video is American. It tends to be other accents these types of software fail on. Show me an Australian and well talk :p

Unfair comparison..the person in that video is American. It tends to be other accents these types of software fail on. Show me an Australian and well talk :p

:laugh:

It's not Australian but this one's a little closer to home for us and it shows off search and dictation in action:

http://www.youtube.com/watch?v=0vW8vE10Snk

The windows phone tried to search bing for something completely different to what was said, and there's not really any contextual guesswork involved with Bing searches..logic dictates transcription would offer the same poor recognition for the man in the video.

Or ergo, logic dictates that the only reason Siri COULD properly understand it is because there was added context. Certainly for the very first one, you can see where context comes in handy. And second is actually phonetically very similar, and context would easily sway a speech recognition system in favour of "Send a text to <contact>" if that's programmed into it's grammar. And same with the third. This video proves nothing, apart from the fact that Siri has a great context engine. (And it is infact a better system than what Windows Phone has, but I'm only defending TellMe, not what Microsoft have done with it in Windows Phone)

The actual underling accuracy of the speech -> text can only be shown in a situation where no context is used, in raw transcription, which that original video didn't. And unfortunately I have no Australians in my house to make a video comparing how they work for Australians :p

Or ergo, logic dictates that the only reason Siri COULD properly understand it is because there was added context. Certainly for the very first one, you can see where context comes in handy. And second is actually phonetically very similar, and context would easily sway a speech recognition system in favour of "Send a text to <contact>" if that's programmed into it's grammar. And same with the third. This video proves nothing, apart from the fact that Siri has a great context engine. (And it is infact a better system than what Windows Phone has, but I'm only defending TellMe, not what Microsoft have done with it in Windows Phone)

The actual underling accuracy of the speech -> text can only be shown in a situation where no context is used, in raw transcription, which that original video didn't. And unfortunately I have no Australians in my house to make a video comparing how they work for Australians :p

http://m.youtube.com/index?desktop_uri=%2F&gl=GB#/watch?v=E91Qu1nVQtE

I can't find a dictation video but this shows the conversation features...very accurate :p

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Microsoft making much needed change to Windows 11, 10 Patch Tuesday security updates by Sayan Sen Recently, Microsoft delivered its latest Defender patches for Windows 11 ISOs. These definitions are released from time to time alongside the general security updates available during Patch Tuesday. Speaking of Defender, the company has now announced another important change that affects how security updates are delivered to enterprise devices running Windows. According to a recent announcement, Microsoft Defender for Endpoint's endpoint detection and response (EDR) updates will no longer be bundled with the monthly Windows security updates or Patch Tuesdays. Instead the company is shifting delivery of these updates to Microsoft Update, bringing EDR servicing in line with several other Microsoft Defender components. If you recall, Microsoft last year moved PowerShell updates to Microsoft Update (MU) as well since it provides automatic updates for Microsoft products and services. Thus the move is intended to allow Microsoft to deliver EDR improvements and security enhancements independently of the OS's regular monthly update cycle; this should enable faster deployment of protection updates without requiring organizations to wait for the next Patch release. For those unfamiliar, Microsoft Defender for Endpoint's EDR capabilities are designed to help organizations detect, investigate, and respond to advanced threats across managed devices. Keeping these components updated is critical for maintaining protection against evolving attack techniques. The rollout has already began for Windows 10 devices in late May 2026 (last month) and Microsoft says it will gradually expand support to Windows 11 and the remaining supported Windows versions over the coming months. The company expects deployment across Windows 10 and Windows 11 to be completed by fall 2026 or around Q3 of this year. Once the transition is complete, EDR updates will be delivered through Microsoft Update using KB5005292, provided the required prerequisite updates have already been installed. Microsoft is also introducing a new Defender Update Service as part of the change. Following installation of the first update, devices will automatically create a new directory located at %ProgramData%\Microsoft\Microsoft Defender\Defender Update. Microsoft notes that restarts may occasionally be necessary in case of "rare" failure scenarios. For most organizations, the tech giant says no action will be required as long as Microsoft Update is already permitted within their update management strategy. Admins who rely on manually deployed update packages, however, will need to adjust their processes to ensure the new Defender update package is included. Microsoft also recommends reviewing internal documentation and notifying helpdesk and security operations teams about the updated delivery mechanism to avoid confusion during the transition. As a prerequisite, the tech giant notes that systems must be running Sense version 10.8798.25857.1000 or later and have one of the following Windows updates (or later) installed: Win11 24H2 KB5062660 (2025-07 Cumulative Update Preview) Win11 23H2 KB5062663 (2025-07 Cumulative Update Preview) Win11 22H2 KB5062663 (2025-07 Cumulative Update Preview) Win10 22H2 KB5062649 (2025-07 Cumulative Update Preview) Win10 1809 KB5063877 (2025-08 Cumulative Update) Server 2019 KB5063877 (2025-08 Cumulative Update) Server 2022 KB5063880 (2025-08 Cumulative Update) Server 2025 KB5063878 (2025-08 Cumulative Update) As always, organizations should verify that their update policies align with the new servicing approach before the broader rollout reaches all supported Windows platforms later this year. In case of major problems, the EDR update can be rolled back to the inbox version stored in %ProgramFiles%\\Windows Defender Advanced Threat Protection (ATP) using: MpCmdRun.exe -RevertMde -Product Edr -ToVersion Inbox For those who have access to the Microsoft 365 Admin Center portal, you can view the message here under ID MC1381119.
    • My problem with these smart scopes is that IMO they're not really an entrance into astronomy, they're an entrance to looking at pretty(ish) pictures that you take by essentially pressing a button and letting the scope do the work. I still maintain that getting some binoculars or a solid dedicated telescope (which doesn't have to mean expensive) and actually spending time learning the night sky and using a telescope is a much better way to actually learn. But, granted, the learning curve is a bit steeper (as it tends to be).
    • I never played Crazy Taxi before, but this looks like a fun game!
    • The perception of Microsoft Edge is unfortunate as it's a world class browser. I'm not exaggerating when I say it's the best browser I've ever used with its feature set, speed, reliability and advanced security. Running the extension Ublock Origin is a must though to deal with advertising and trackers.
    • You can check out our latest 9070 GRE review with scores for the XT too, but these are on Windows: Gaming performance Productivity performance
  • Recent Achievements

    • One Month Later
      DJC50PLUS earned a badge
      One Month Later
    • Week One Done
      DJC50PLUS earned a badge
      Week One Done
    • Proficient
      Eric Biran went up a rank
      Proficient
    • Dedicated
      Conjor earned a badge
      Dedicated
    • Week One Done
      Windows Guy earned a badge
      Week One Done
  • Popular Contributors

    1. 1
      +primortal
      493
    2. 2
      PsYcHoKiLLa
      248
    3. 3
      Steven P.
      73
    4. 4
      +Edouard
      69
    5. 5
      neufuse
      68
  • Tell a friend

    Love Neowin? Tell a friend!