Showing posts with label Clarissa. Show all posts
Showing posts with label Clarissa. Show all posts

Sunday, November 30, 2008

Ultra HAL, your personal computer assistant


HAL the talking computer is here, just like in the famous Sci-Fi movie "2001: a Space Odyssey"! Well, ok, not REALLY, but at least this HAL won't lock you outside or kill you while you sleep! Ultra Hal is a fun HAL, and it's here NOW:

Ultra Hal Assistant 6.1 - NEW!
[...] Ultra Hal Assistant is your digital secretary and companion. He (or she depending on your character preference) can remember and remind you of appointments. He can keep an address book. He can keep a phone book, and even dial phone numbers for you! Hal can also run programs and recent documents on command. Hal can help you browse the Internet. He will offer you help with most of your Windows programs. Hal does all of this from natural language -- simply tell him or ask him something in plain English!

Hal has huge conversational database and can chat about anything at all. Hal will learn from every single sentence that you tell him and over time Hal will learn to like the same things you do, and to talk about topics you like to talk about. Ultra Hal Assistant even has built in speech recognition so that you can speak to Hal out loud instead of typing. Ultra Hal utilizes an advanced realtime 3D character engine from Haptek that delivers 3-D artificial human characters so convincing and engaging you could swear they were real. You can download the free trial version from this site. Find out whats new in version 6.1 [...]



Now I have played with the Ultra Hal Assistant a bit... calling it intelligent may be a bit of a stretch... though that may depend on what your definition of Artificial Intelligence is. Hal really does have the capacity to "learn" things from you, and with time and training, it does give seemingly intelligent, even surprisingly clever, replies and comments.

How useful it will actually be remains to be seen. I'm not much of a chit-chat person when talking with real people; chit-chatting with a computerized Artificial Intelligence can seem like rather a waste of time, once the novelty wears off. But giving it commands to look for stuff on Google for you, look up the weather and read it to you, dial phone calls and such - are things some people may find useful. And as Hal learns from you, the chit-chat gets more interesting.

You can use the free trial version, but to buy it costs only $29.95. It can be used as the "brain" for other 3rd party software products. I have to say that, what's probably more impressive than what Ultra Hal does, is the potential for what it can become. It's a technology in it's childhood, about to grow up.

Remember that Arnold Schwarzenegger movie, The 6th Day? There is one scene where the Arnold character is at the home of a divorced friend. The friend has a female talk-bot on the screen, and they flirt with each other. The Ulta Hal program can work with a variety of 3rd party "characters" that function in much the same way. Check out this on-line, interactive, talking People Putty Demo by Haptek. There are even various plug-ins for Ultra Hal, including alternate voices, brains, speech engines, and animated faces and characters, even flirty sex-bot characters, precursors to the one in the movie. The future is closer than you think.

There are also third-party companies that make realistic "voices" that are compatible with Ultra Hal. One such company is called Cepstral, that makes really good sounding computer voices. You can download free samples to try out, they come with an easy to use text reader software program: www.cepstral.com.

The combination of these products will provide you with a near-HAL 2001-a-space-oddity type experience. It's pretty kewl... just don't plug it into your cryogenic sleep chamber when you got to bed at night. ;)

A HAL type project was implemented by NASA on the International Space Station in 2005:

Clarissa: a HAL type computer for the ISS?

Clarissa is capable of understanding multiple voices of astronauts, recognizing when astronauts are talking to each other and not to it, and can deal with ambient noises, etc. While efficient in those areas, she does not try to make conversation like HAL does.

I'd like to see Clarissa's voice comprehension used in Ultra Hal. Probably the best voice recognition software available to most of us on planet Earth would be Dragon Naturally Speaking 10, which is I believe, what Clarissa uses. The reviews I've read seem to indicate it's the most accurate commercial voice recognition software commonly available. DNS doesn't make idle chit chat with you like Ultra Hal, but it can be used for speaking commands to your computer, as well as taking dictation.

Older version's of Dragon's speech engine have been compatible with Ultra Hal, but it's unclear to me whether or not the latest speech engine in Dragon Naturally Speaking can also be used with it. I've read conflicting reports, some say yes, some say there is restrictions that prevent it.

I've found that Ultra HAL had some trouble understanding my voice, because it would hear it's own voice speaking in reply and think it was me talking. I eventually gave up trying to teach HAL to understand my speech, meaning I had to use the keyboard to communicate with HAL. It would be nice if Clarissa's technology could be added to HAL's, making it more of a hands-free experience. As it was, having to keyboard my responses to HAL was not very productive for me, because I could not do other things while I was communicating with HAL. I would love to chit-chat with HAL and teach it things, IF my hands were free so I could get on with other things I need to do. If I could verify that the Dragon 10 engine would work with HAL, I'd buy it. Being able to chat with HAL hands free, would be worth it.

HAL does have a "Brain" settings panel, where you can set it's learning parameters. You can actually deliberately teach it things, and see them reflected back to you in it's responses. Sometimes the replies it can give you really do seem intelligent and even witty. The Ultra HAL program won the 2007 Loebner Prize for artificial intelligence most able to pass for a human.

An on-line talkbot called Elbot won the prize in 2008. It's an excellent on-line talk-bot, but not available for download onto your computer like HAL is. HAL is the only one I know of that you can download and use on your PC.

If you would like to try to converse with an on-line version of HAL, you can do so here:

Chat with Web HAL

I think all of these things, artificial brains, faces and voices, are combining to become one of those Next Big Things. As these technologies improve, I think it's going to take off and be very BIG. If you want a taste of things to come, download the free trial version of Ultra Hal and play with it. I've had great fun with it.

Zabaware has a page listing bloggers who have reviewed Ultra Hal:

Read what bloggers are saying about Ultra Hal,
an artificial intelligence chatter bot



UPDATE 12-11-08: I received an email dated December 2nd, from Zabaware. Here is an excerpt:

[...] Zabaware is hard at work developing the next version of Ultra Hal Assistant, which will be available in 2009. The new version will be a free upgrade to all Ultra Hal Assistant 6.x users and will include brain improvements and a new 3D graphics engine. Also in 2009 Zabaware will introduce cutting edge speech recognition and microphone technology, which will let you talk with Hal naturally without a clumsy headset microphone. Be sure to check www.zabaware.com in early 2009 for further news.

Sincerely,
Robert Medeksza

Mr. Medeksza is the creator of Ultra Hal. This is excellent news! I'll be looking forward to the upgrade and the improved speech recognition.


UPDATE 01-31-09:
There is a new Release Candidate, Ultra Hal 6.2, available for download on the Hal Forum. I've tried it out, and it's really good. Hal no longer hears his own voice when he speaks, and the overall voice recognition seems greatly improved. There are other improvements as well, I will post more about it when the final release out. For those who can't wait, check it out.

UPDATE 02-09-09:
The new release, Hal 6.2, is officially out. Free trial download available, free upgrade for current owners. I bought the ViaVoice plugin, and so far, it's very nice. I will do a post about this new version after I have used it for a while.

Get yours' here: Ultra Hal Assistant 6.2 - NEW Version!
     

Monday, March 24, 2008

Clarissa: a HAL type computer for the ISS?

Much of the technology portrayed in the famous science fiction film "2001: A Space Odyssey" has failed to materialize by the year 2001. The HAL 9000 computer in the movie is no exception.


Artificial intelligence (AI) is a controversial topic, with a lot of disagreement as to what actually constitutes real intelligence. Many argue that a computer like HAL is way off in the future, while others would maintain that it's closer than we think.

Whichever opinion one holds, it's clear that the science of AI is moving forward anyway. Today we may even be seeing the beginnings of what could one day lead to a HAL like computer. In fact, perhaps we already have HAL's great great grandmother! In 2005, the International Space Station got a talking computer called Clarissa to help the astronauts by reading instruction manuals to them. Maggie McKee explains it to us in this article from New Scientist:

Space station gets HAL-like computer      [published June 2005]

A voice-operated computer assistant is set to be used in space for the first time on Monday – its operators hope it proves more reliable than "HAL", the treacherous speaking computer in the movie 2001: A Space Odyssey.

Called Clarissa, the program will initially talk astronauts on the International Space Station through tests of onboard water supplies. But its developers hope it will eventually be used for all computer-related work on the station.

Clarissa was designed with input from astronauts. They said it was difficult to perform the 12,000 procedures necessary to maintain the ISS and conduct scientific experiments while simultaneously reading through lengthy instruction manuals.

"Just try to analyze a water sample while scrolling through pages of a procedure manual displayed on a computer monitor while you and the computer both float in microgravity," says US astronaut Michael Fincke, who spent six months on the station in 2004.

Clarissa queries astronauts about the details of what they need to accomplish in a particular procedure, then reads through step-by-step instructions. Astronauts control the program using simple commands like "next" or more complicated phrases, such as "set challenge verify mode on steps three through fourteen".

Kim Farrell, Clarissa project manager, simulates on-orbit use of the system in the International Space Station mock-up at Ames Research Center.

"The idea was to have a system that would read steps to them under their control, so they could keep their hands and eyes on whatever task they were doing," says Beth Ann Hockey, a computer scientist who leads the project at NASA's Ames Research Center in Moffett Field, California, US.

That capability "will be like having another crew member aboard", says Fincke. (You can see Clarissa in action in a mp4 video hosted on this NASA page.) [...]

Clarissa uses an "open mic", and is capable of understanding multiple voices of astronauts, recognizing when astronauts are talking to each other and not to it, can deal with some ambient noise, and has a high voice recognition rate of around 94%, making it a very useful and professional tool. You can read the full article for more details, and there's more videos of Clarissa on NASA's web site:

Clarissa NASA page with photos and videos

Clarissa is cutting-edge technology, and is leading the way for future voice recognition and text-to-speech applications closer to home.

Beth Ann Hockey is the project leader of the Clarissa project.
The Clarissa software program also borrows her voice.


I find the Clarissa project interesting not only for what it does now, but for what it has the potential to do in the future. The following is an excerpt from an interview with the project's leader, Beth Ann Hockey, who gives us some insight into where this is going:

WHO'S WHO AT NASA: Beth Ann Hockey

[...] NTB: How will NASA utilize Clarissa?

Hockey: It could be used widely in any area of NASA that uses procedures like these; however, spoken-language and spoken-dialogue technologies are much more general than that and can be used in all sorts of other places. For example, we had some conversations about using it for ground-maintenance crews and for developing applications for use in mission control. Any time you want to have your hands and eyes free, it will be a win. There are many times that it could be beneficial simply because you’re moving around. If you had wireless technology, plus the spoken-dialogue technology, you could move around and still be accessing information that you need.

NTB: How did Xerox contribute to this project?

Hockey: In the realistic-experimental version that we have, we worked on some technology with Xerox because one of the big ideas behind this was to have your hands and eyes free; we did not want the user to have to push a button to indicate that speech recognition should start, which is the way that some systems are designed. We needed to have the speech recognition running constantly. The system has to decide whether the speech that it’s hearing is directed at it – is it a command it should understand – or is it something it should ignore.

We got together with Jean-Michel Renders from Xerox Research Centre Europe, an expert on kernel methods, and we believed that those methods would do a better job on this problem. We worked with Renders on using the kernel methods to make this open-microphone decision, and we cut the error rate in half.

NTB: What are possible commercial applications for Clarissa?

Hockey: I just gave a talk at the V-World Summit, which is held by Nuance Communications for their developers and customers. I was invited because they see what we’re doing as the next-generation of applications in their area. Nuance is the speech-recognition engine that we use. We build the language understanding in addition to that engine. Nuance is the first stage in what we use; it takes your acoustic signal and makes a good guess at the words that signal might have been. Nuance’s main business is supporting telephone-bank-type applications. For example, if you call an airline to check flight information or if you have an automated banking application that you interact with, those are probably built with Nuance. These are the types of applications that now are commercially common.

The application that we did for the astronauts is more complicated in a lot of ways when compared to those systems, which feel like a “menu only” that you’re talking to. Our system feels like you’re having a conversation with somebody who may not be the brightest person, but it feels more like a conversation. It’s natural, as there are more of these kinds of menu-type commercial applications out there and people get used to them, to move toward a more conversational technology. This is true especially as the technology keeps maturing.

Aside from the menu-type uses for this technology, the navigating of procedures applications could be natural for doing any kind of equipment maintenance (i.e., airlines). For example, tasks in which you’d have to have your hands doing something while you’re laying underneath a piece of equipment and it’s not convenient to stop and scroll through a computer screen or flip though papers. So there already are plenty of commercial applications; we’re just carrying it to the next level.

I’ve been talking mostly about this procedure navigator, while in fact the component technologies in that are even more widely applicable. In particular, the other project on which I am the lead is called Regulus. We’re developing an open-source tool kit to try and make the creation of spoken-dialogue interfaces more accessible to regular developers. Currently, you have to have someone with expertise in language technology to be able to do this well, but we’re trying to make it so that people can take this toolkit and make their own simple-to-moderate interfaces. It’s open source – people can simply download it. We also are working on a book that will include tutorial materials on how to use that system, which should be coming out next year. If people are interested in that, they should contact us. [...]

I did a post earlier about Dragon NaturallySpeaking 9 voice recognition software, which uses the Nuance speech engine Ms. Hockey speaks of. The Nuance engine is impressive, and judging from the consumer reviews, it's regarded as the best voice recognition speech engine available. A close runner up is the Microsoft speech engine that's bundled with Windows Vista, which consumers say is nearly as accurate as Nuance's latest version.

Have you noticed the Microsoft TV commercials lately, regarding software driven by voice commands? Voice recognition and Text-To-Speech (TTS) technologies promise to be two of the Next Big Things in computer technology.

Combine it with Artificial Intelligence, and we are on our way to a HAL like computer somewhere in our future.