Apple’s Roadmap: Voice-activated media management system for the iPod

Posted by Dennis Sellers Apple ico Jul 11, 2005 at 1:04am

Apple’s Patent Application 20050015254 reveals a new and exciting concept relating to a server-based media management system. The new system will provide the next generation of wireless iPods with ability to download additional updateable menus, beyond those that come prepackaged with the unit.

A secondary aspect of the media management system relates to a voice activated option that will add a level of convenience to those driving a vehicle (car, truck or boat), riding a bike, jogging and so forth.

The Media Management System

Figure 2 represents a block diagram illustrating one embodiment of a basic organizational structure of a media management system (200). The media management system is the computer program that will allow the user to both organize and access digital media over a network (LAN, WAN or the Internet).
image

A media management system comprises the following: a media database that stores media files; media collection records that include data relating to groupings of the media files; media records that include metadata relating to the media files; a voiced names database that stores audio files; and string association records that associate the audio files with data from the media collection records and metadata from the media records. This particular description was associated with a wireless iPod.

Key point to patent

One of the key points of this patent is that that the media management system isn’t limited to music as is revealed in this paragraph: It should be appreciated that any reference to “songs” or “music” could be generalized to any form of digital media, which can include sound files, picture data, movies, text files or any other types of media that can be digitally stored on a computer. Similarly, any reference to “playlists” can be generalized to media collections, including collections of mixed digital media.

As a side note, one of the devices introduced in this patent was found in patent point # 0024 which lists network-aware audio/video stereo components amongst other standard Mac clients such as a notebook, iMac and so forth, which could have a different version of the media management system. Interestingly Apple’s other iPod related application (20040224638) included the interaction of an iPod to a home theater system, which is exactly what this patent suggests.

Vague reference to a set top box

It wouldn’t be a stretch in the least to consider that one of the possible embodiments relating to this media management system could in fact be a set top box. I say that because patent points 0054/55 specifically present both Motorola and MIPS processors.

The MIPS architecture is a leader within the digital set-top box market. Those using the MIPS architecture for set top boxes today include Motorola, Echostar, Hughes Network Systems, TiVo, Sony, Pioneer and RCA.

Additionally, patent point 0055 goes on to discuss various interface cards such as HSSI interfaces, which are used in Satellite modems and set top boxes, DHEI I/O cards which are associated with broadband multimedia-service routers, and others like cable interfaces. So I can’t see any other reason as to why the patent would include the MIPS architecture, except for it being for a set top box or related device.

Voice Activated Menu

The patent states that the media management system will allow for updateable audio menus. Although a device might have some pre-packaged menu components, other menu components are received from a server. For example, the iPod might come with some pre-installed menu components (e.g., a top menu level of “Playlists,” “Songs,” “Artists,” “Settings,” and “About”) but allow other menu components to be added to the various menu choices (e.g., a user-added top level menu of “Genre” or the second level menu listing of available playlists, songs, and artists).

image

Each menu component, regardless of whether it is original or received from the server, has an associated voiced name. When the user highlights a menu choice, the voiced name is played. The user then has the option of selecting the menu choice or scrolling to a new menu choice. The patent suggests that alternative mechanisms are possible like a music studio embedding the vocalizations into the songs themselves thereby eliminating the need for a voiced names database.

The iPod as a universal remote

One of the key points of this patent as outlined above is that the media management system isn’t limited to music. So it was very interesting to read AppleInsider’s report on Apple’s patent 6,914,551 which covered a universal remote. This couldn’t be more fitting in context with this patent relating to a media management system using a wireless iPod!

Back in August 2004, I presented a patent report entitled “Apple’s Illuminating Future: Chameleon,” where I listed Steve Hotelling as one of Apple’s patent contributors. At that time I also presented the fact that Steve Hotelling was associated with a multi-disciplinary-industry project covering a Unified Universal Remote Console Standard. That standard was associated with next-generation network-centric concepts. One of the interesting features associated with that project was that it was to allow a user to carry around their universal remote with TV and other network-preferences that could be used on any hotel TV around the world that had next-generation broadband connections. The idea was that you’d be able to watch your local channels anywhere, anytime, on any television. Evidently Steve Hotelling’s experience on that project was instrumental in bringing ideas to this new Apple patent regarding a universal remote.

Possible future iPod eyewear

As far as voice activated menus options are concerned, the patent suggests that music studios will begin to include vocalization of titling right into their recordings at some point in the future. That convenience will then be available to new purchases of iTunes which will advance the wireless iPod’s media management system so that it could be used in our vehicles or while simply taking a walk.

image

One application that could apply here is that of a wireless iPod connected to Oakley’s RAZRWIRE Bluetooth Eyewear that Motorola is to unveil in the second half of this year. Apple could decide to work with Oakley directly or simply offer the Motorola version as a third party iPod accessory. Apple does list QUALCOMM’s PureVoice in relation to this aspect of the patent. So being able to request tunes via a microphone built into RAZRWIRE like eyewear, would allow cyclists and others to conveniently access tunes hands free. Now that’s cool.

Bottom line

All in all, Apple’s patent for a media management system presents some very promising ideas for advancing our digital lifestyle. The fact that the media management system could be utilized beyond just music and into areas such as television or controlling a stereo – is the key. And the news of Apple’s new universal remote patent really kicks things up a notch. The patent’s reference to the MIPS processor definitely suggests that the media management system could work in conjunction with set top boxes from various OEM’s, if not one from Apple themselves.

Here’s to hoping that someday soon we’ll be seeing Apple actually introducing some of these products instead of just reading about them in patents!

Other recent iPod related patent reports

“A notebook with integrated iPod mini”

“Apple’s Personal Mobile Radio, TV & Telephony”

Frank Petrie Says:

A voice activated iPod? Priceless.

Posted on July 11, 2005

Dave Says:

Well said Frank, priceless!

Posted on July 11, 2005

Cats Says:

Good one. A media management system is a key patent like you said, if it comes to market. The Mips and qualcomm additions are very interesting.

Off topic, I also see that AppleInsider is playing catch-up on the cup holder patent you covered last week. Not that I care about a cup holder, but nice to see that you covered first for Macsimum.

Posted on July 11, 2005

scottschor Says:

I don’t understand why voice actuation hasn’t become more mainstream after all these years.  I used to show-off for friends about how I could give voice commands to my (then) new Powermac 7100/66 ... and that was approx. 10 years ago.  What am I missing about the complexities of implementing voice actuation?  Thank you for all thoughts or insights.

Posted on July 13, 2005

Denny Says:

I think voice activation will have its breakthrough in vehicles.

Why hasn’t taken off is technical but perhaps there’s too many dialects/accents to deal with. How do you tell a customer that his system isn’t working because they can’t speak English properly. Yeah, that’ll really go over well. And then you have the “HAL” factor. Do you really want to converse with a computer :-)

HAL, play “Run like Hell” from Pink Floyd. Yes Denny, but why do feel the need to run? Shut up HAL and play the damn music :-)

Any thoughts of your own Scott?

Posted on July 16, 2005

scott schor Says:

I’m sure voice activation would work in autos, but why stop there?  If voice recognition can be reduced to a chip in an automobile, why not anywhere else?  TV or radio channel switchers, ovens, bank transactions, washers and dryers ... once a voice interpreter on a chip it should propagate everywhere.

Too many dialects?  I remember IBM’s ViaVoice training to receive my voice when I first got it.  iTMS started in the US, and then over the course of a great deal of time spread around the world to places of different languages and dialects, and it’s still spreading.  Perfecting dialects would be an ongoing maturation of the technology.

I’m still not sure I understand the technical difficulties inherent in implementing full scale voice services.  Certainly the phone company does it, my credit card companies do it, my healthcare provider does it ... and so on.

I’m not so concerned with Hal becoming my nemesis.  In fact, my computer is indeed one of my tools.  I want it to respond to my wishes with minimal effort on my part.  I ultimately believe voice commands are the simplest means of conveying my wishes.

Computer: get my e-mail, open salon.com and nytimes.com, check for updates.  Radio: Wake me up tomorrow morning at 5:00 AM.  Shower: Turn on tomorrow morning at 5:10 AM at 105 degrees.  Coffee maker ... blah blah blah ...

No, I don’t believe the reason for the absence of voice actuation is technical but, rather, economical.  If I knew the real reason, however, I would be incredibly affluent and luxuriating in the mountains of Tibet.

Posted on July 18, 2005

Commenting is not available in this weblog entry.

Article Information

Comment on this Article Print this Article Email this Article Digg This

Contributor

Contributor

Dennis Sellers

Dennis has been a newspaper editor/reporter (seven years) and teacher (seven years). He has over 4,000 magazine, newspaper and online articles to his credit.  He has also covered the Mac and tech industries for over a decade for such online publications as MacCentral, MacMinute and now MacsimumNews.

Recent Articles


Hotel München