Computers

This is way nerdy stuff - significant way nerdy stuff

This is way nerdy stuff - significant way nerdy stuff
View 1 Image
1/1

November 8, 2004 Apple's forthcoming Tiger operating system offers a compelling advantage. The following are Apple's words, but we concur and we think the advent of this system at this point in time is a landmark. Tiger will enable unmitigated access to meta-data for all. This access will enable the development of data-mining tools that will enable the world to mine its digital assets more effectively. If you were waiting for Apple to follow up the iPod, a product that established its presence in the entertainment industry, then this is it. This is devilishly clever! Meta-data is, succinctly stated, data about data. It provides a description about the data contained in a file such as (but not limited to) its height, width, size, creator, copyright holder, title, editor, created date, and last modification date. In practice, there is so much data that can be considered meta-data that these descriptions usually are generalized as a dictionary of descriptive attributes indexed by keys. You can think of it as the "what, when, and who" of a piece of data.

This access will enable the development of data-mining tools that will enable the world to mine its digital assets more effectively.

If you were waiting for Apple to follow up the iPod, a product that established its presence in the entertainment industry, then this is it. This is devilishly clever!

From Apple's Developer Forum: Spotlight marks a watershed in operating system history. For years people have been talking about making the file system as quick and easy to search as the web as well as using meta-data to make those searches more accurate. For years it's been all talk. Other operating systems have long promised it. Third party add-ons are starting to appear that provide it. When Tiger ships, however, it will be the first industrial strength operating system to feature a fully integrated, fast, and efficient search across all of the files on a system.

Make no mistake about it, Spotlight isn't "bolted on" to the system. It's a completely new search technology that is tightly integrated with a fundamental part of the OS: The file system. Every time a file is created, saved, moved, copied, or deleted, the file system automatically ensures that the file is properly indexed, catalogued, and ready for whatever search query might be issued-all in the background. These abilities build on the already impressive capabilities of the journaled HFS+ file system.

Not only is Spotlight available to end users, but the array of search technologies that make up Spotlight are also available to developers. This means that you'll be able to tap into these powerful search technologies to find files to display, plug-ins to load, and data to mine in your applications. No restrictions. No limits.

The technologies that power Spotlight are:

A database consisting of a high-performance meta-data store and content index that is fully integrated into the file system.

Programmatic APIs that are part of the CoreServices and Cocoa frameworks that let you query the meta-data store and content index.

A set of importer plug-ins that are used to populate the meta-data store and content index with information about the files on the file system.

A plug-in API allowing you to provide meta-data and content to be indexed for your application's custom file formats.

But more than a collection of individual technologies that work together, Spotlight gives you the ability to plug your application into the operating system and work with files in a totally new way. For example, if you were building an asset management application you could use Spotlight to find all of the files that match certain criteria rather than trying to slog through the file system yourself. Or, if your application specialized in supporting various kinds of workflows, you could use Spotlight to find all of the files that needed to be marked with a particular keyword. Once you get used to working with files in this new way, you'll never want to go back.

This article shows you how Spotlight works, how to programatically query the Spotlight Store, and how to create your own file format importers. As you can see, there is quite a bit of ground to cover. First, however, let's start out by defining what meta-data is.

What is Meta-data?

Meta-data is, succinctly stated, data about data. It provides a description about the data contained in a file such as (but not limited to) its height, width, size, creator, copyright holder, title, editor, created date, and last modification date. In practice, there is so much data that can be considered meta-data that these descriptions usually are generalized as a dictionary of descriptive attributes indexed by keys. You can think of it as the "what, when, and who" of a piece of data.

Some kinds of meta-data, such as file modification dates, ownership, and access permissions are kept external to the file by the file system and have been accessible via a variety of mechanisms. But the most interesting kinds of meta-data are found inside the file. For example, digital cameras embed all sorts of data, such as exposure information and whether a flash was used, into the image files that they produce. As well, files written by most applications, including Adobe Photoshop and Microsoft Word, contain quite a bit of meta-data.

Until now, this data has been buried in individual files, which has made it hard to work with and to search against. Spotlight gathers all of this information into the Spotlight Store allowing for quick, easy, and effective searches.

No comments
0 comments
There are no comments. Be the first!