LWN.net Logo

Social Semantic Sense for the Desktop (MIT Technology Review)

MIT Technology Review takes a look at the NEPOMUK Project. "People naturally group information by topic and remember relationships between important things, like a person and the company where she works. But enabling computers to grasp these same concepts has been the subject of long-standing research. Recently, this has focused on the Semantic Web, but a European endeavor called the Nepomuk Project will soon see the effort take new steps onto the PC in the form of a "semantic desktop."" (Found in KDE.News)
(Log in to post comments)

Semantics nitpick ...

Posted Dec 31, 2008 19:42 UTC (Wed) by AnswerGuy (guest, #1256) [Link]

... people group information by topic and remember relationships *among* them.

Between is only appropriate in a context involving two objects or concepts. If there are more than two of these involved the appropriate term is "among."

(It was a nitpick noted in my first book manuscript something I used to be sloppy about myself. Since then it's become a pet peeve).

--
JimD

Semantics nitpick ...

Posted Jan 1, 2009 14:28 UTC (Thu) by roblucid (subscriber, #48964) [Link]

Not nit-picking seems a lot clearer. European non-english speakers will probably find "between" much less confusing than "among" even if it is more grammatically correct.

Being understood is what matters most.

Semantics nitpick ...

Posted Jan 1, 2009 18:48 UTC (Thu) by bfields (subscriber, #19510) [Link]

Hmph. Consulting the OED's entry for between:

In all senses, between has been, from its earliest appearance, extended to more than two. In OE. and ME. it was so extended in sense 1, in which AMONG is now considered better. It is still the only word available to express the relation of a thing to many surrounding things severally and individually, among expressing a relation to them collectively and vaguely: we should not say ‘the space lying among the three points,’ or ‘a treaty among three powers,’ or ‘the choice lies among the three candidates in the select list,’ or ‘to insert a needle among the closed petals of a flower.’

I'm not sure which side the above would argue for in the current case. My ear wants "between". In any case, it's somewhat more complicated than a simple 2 vs. more rule. (And the "them" here could be read to always refer to pairs anyway.)

Semantics nitpick ...

Posted Jan 2, 2009 7:32 UTC (Fri) by rsidd (subscriber, #2582) [Link]

Merriam Webster says the same, and gives several more examples of where "between" is appropriate for multiple objects. So there it is from both sides of the pond.

Which is appropriate here? My opinion is: if it's not obvious, don't nitpick. Perhaps both are appropriate.

Semantics nitpick ...

Posted Jan 2, 2009 7:37 UTC (Fri) by rsidd (subscriber, #2582) [Link]

PS - in this case, the full sentence being nitpicked is in fact "People naturally group information by topic and remember relationships between important things, like a person and the company where she works." The example given is of two things, and it seems to me that the relationships being considered are in general binary relationships. (I can't think, off-hand, of non-binary relationships that cannot be decomposed into binary relationships.) So both premises of the nitpick were wrong.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 7:28 UTC (Fri) by petegn (guest, #847) [Link]

Yet another waste of your desktop resources , Already found and disabled just like the flaming dog (beagle) if i want something wasting time trawling thru my computers hard drive I will tell it to and this continued effort by people to install useless junk on your desktop needs to be stopped and quick else we WILL end up just like WindBloWs full of crap and wideass open say nay if you like but dont say you were not warned

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 9:17 UTC (Fri) by drag (subscriber, #31333) [Link]

huh. You must be using a different Linux system then I am. Lxde and tracker desktop search does a very good job of running on my older machines. I am happy to sacrifice the 1-3MB it takes to be able to locate files by their contents very quickly. Harddrives are just too slow to be waited on.

----------------------------

Just a month or two ago I set up a old embedded system with Linux. A SBC blade-style computer that plugged into a ISA bus along with any peripherals it needed. 133mhz Pentium with 32MB of RAM.

I used the latest and greatest kernel downloaded straight from kernel.org and then I used busybox for userland. Got the system configured, booted, and running from a single 1.44MB floppy drive. No patches required, no hacking any sources. Just configured and built from code downloaded straight from the respect project's stable releases.

-----------------------------

Linux, as always, and remains to this day, is what you make of it. If you want a bloated desktop, it's yours. If you want a slim system, it's yours. Just make it do what you want. Just requires some effort and knowledge.

Personally, I don't give a crap about how 'bloated' things get. As long as my system is fast, responsive, and doesn't spend much time in swap I am happy. I care much more about my ability to get things done quickly. If sacrificing 512MB or more of my RAM on my desktop on some feature that will save me a hour or so of wasted work a day fumbling around with my computer, then that would be a trade off I would happily make. If I don't see any benefit in it, on the other hand, I'll just turn it off and forget about it.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 9:19 UTC (Fri) by drag (subscriber, #31333) [Link]

"drive".. oops make that a single '1.44' disk. No separate 'root' and 'boot' for me.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 22:38 UTC (Fri) by sbergman27 (subscriber, #10767) [Link]

Tracker does a much better job than Beagle at staying out of the way. I administer XDMCP servers. I have one server running about 60 simultaneous business desktops, and when I started enabling tracker for users, I was prepared for it not being acceptable from a performance standpoint. But, in fact, we barely noticed any effect from the 60 trackerd processes. Performance is still excellent. And my users have the benefits it offers. I'm very impressed with tracker.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 8, 2009 22:30 UTC (Thu) by lysse (guest, #3190) [Link]

Don't feed the trolls, people!

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 9:53 UTC (Fri) by Janne (guest, #40891) [Link]

"Yet another waste of your desktop resources"

It consumes a fraction of your "resources". Initial indexing might take some time with these search-tools and the like but after that, it's smooth sailing.

And pray-tell: what would you rather do with your resources? 99% of the time, the CPU's in your computer run idle. You have more processing-power available to you that most of the time, you use only a fraction of it. Why not use that power for something useful instead? Do you need a 2+GHz quad-core CPU and gigabytes of RAM to type text to lwn.net? No you do not.

"if i want something wasting time...."

So, instead of letting your computer create an index of your files, you would rather waste your time manually searching for your files? Nice logic...

"I will tell it to and this continued effort by people to install useless junk"

I don't find things like these "useless". Quite the contrary. And if everyone thought like you did, we would still be stuck in the eighties, as far as computing is concerned.

Computers are supposed to remove the burden of manual hand-holding. And things like these are doing exactly that. Sure, the might consume some resources. But considering how much CPU-power, RAM and the like we have at our disposal, the negative impact they have on overall performance is minimal. And te potential for positive performance-impact is humungous. It's A LOT slower to manually search for files and content than letting the computer do it for you.

"WindBloWs"

How old are you? 12?

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 17:44 UTC (Fri) by petegn (guest, #847) [Link]

Probably considerably older than you if push came to shove and have been involved in the PC side of computing almost since day ONE .

Yes it is a waste of resources AFAIAK

and yes microSloPs produces windBloWs
and no i could not give a rats a** about your system
i find them an absolute total waste of time

each to there own
Linux systems come with the find utils and the locate DB that is plenty quick enough unless of course your are a speed freak also if you have a decent file system layout you know where your files are stored

:-) .


Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 2, 2009 20:52 UTC (Fri) by sebas (subscriber, #51660) [Link]

So being an old fart doesn't make you a smart, or even a considerate fart. :-)

I guess we all agree that the shoe that fits one person doesn't necessarily fit any others' person's feet, and that shouldn't be discussed. Nevertheless, I personally found your comments below the relatively high level of both, intelligence and respect I'm enjoying reading LWN for.

Nobody forces anyone to use a desktop indexing system (which is what you seem to be talking about). I somewhat see your point, until now, desktop search systems I've tried didn't cut it for me. (And I didn't even consider running it on 15-year-old hardware like you seem to be enjoying.) Mainly because of two points: resource consumption and lack of integration. (Basically they're too slow and too far away from what I've been doing.)

The point of the NEPOMUK technology though goes much, much further than indexing of files on the desktop. It's about semantic technology that can be used to organise users' data in a way that comes closer to how the human minds organises the data.

I see NEPOMUK very much like the next big thing, the enabler that sets us apart from proprietary technology. Implementing the semantic desktop, a desktop that's smarter and presents all the information you store on it in a way that's understandable and recognizable for human beings is one of the next big things that we're planning to do in KDE. And I think we have the foundations in the technology at our fingertips now. So let's have a look at how this could look like, and some use cases along with it.

We're started off with desktop indexing ("find me files containing 'fnuiken'"). We've added manual tagging and rating of files on top of that ("Show me all files belonging to project XyZ"). This is where we are now, pretty much, although the lack of user interfaces available for that make it all a bit hard to discover.

Next up is making applications provide meta information automatically (think of storing information of origin, usage and context ("You've downloaded this file from foobar.org, you last opened it when viewing holiday photos on 13 December"). Integrating information from ontologies would be a following step ("This photo shows your friend Max, click here to chat with him" as another cheesy example).

We're only just touching the surface of what a technology like NEPOMUK enables us to do. Dismissing it as waste of resource, while fine on a personal level, doesn't really do justice to its potential, despite its horrible name ;-)

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 3, 2009 10:32 UTC (Sat) by Janne (guest, #40891) [Link]

"Probably considerably older than you if push came to shove and have been involved in the PC side of computing almost since day ONE ."

Ooooh, I'm impressed. Or maybe not.

"and yes microSloPs produces windBloWs"

Like I said: 12 year old...

"also if you have a decent file system layout you know where your files are stored "

Manually locating files or folders (eiter with a filemanager or through CLI) is A LOT slower than typing few letters in a search-tool. Even if you have the worlds smartest filesystem-layout, you are still a lot slower at typing/mousing than your computer is at searching.

So try a better argument.

We are fast approaching (or we have reached it already) a point where you could have a whole core in your computer dedicated at running various housekeeping-tools (indexers, spellcheckers etc.) and it would have a neglible impact on overall performance.

What you want to do is to do everything yourself. But you fail to understand the fact that CPU-time is cheap. Very cheap. But the users time is NOT cheap.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 3, 2009 17:28 UTC (Sat) by Sutoka (guest, #43890) [Link]

>Linux systems come with the find utils and the locate DB that is plenty quick enough unless of course your are a speed freak also if you have a decent file system layout you know where your files are stored

I've never liked locate because it thrashes the disk every single time it updates the DB (destroying the disk cache, as well as making most all desktop applications slow as hell). Locate also sucks w.r.t. staying up to date (it doesn't, for only finding system files this is okay but sucks when trying to find your own data). rlocate avoids always being out of date, but doesn't solve the disk thrashing or only operating on filenames, as well as requiring you to run a daemon and kernel module.

Having a decent file system layout only helps when you have a small number of specific files. As you get more and more files, a hierarchy approaches uselessness for *fast* retrieval, often simply getting in the way.

Desktop search isn't meant to replace locate or filesystem layouts, it's meant to solve a COMPLETELY different issue, one that deals with an explosion of information that requires the entire system to cooperate to successfully alleviate.

IMO, NEPOMUK is the _very_ beginning of the road to a desktop search that doesn't suck. The ideas behind nepomuk is the answer, it'd be great if more work could be put into it (especially app support).

>i find them an absolute total waste of time
Right back at ya. They're inadequate for mine and many other peoples' needs. It's high time we advanced beyond hierarchical representations of information that are merely a vestige of the limitations of an analog world.

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 3, 2009 18:14 UTC (Sat) by dkite (guest, #4577) [Link]

A few years ago I was on irc, discussing search with some kde devs. I said
something to the effect that how different would it be from grep. Frankly,
I am not interested in a slow and disconnected version of grep.

But this isn't grep. The closest I've seen to a very useful search function
was on the palm os. Each application responded to a search call, returning
the information it considered appropriate. It wasn't fleshed out on that
platform, but had potential.

When the applications that know where the data came from and know the
structure of the data push it into nepomuk, then there is a possibility to
be very useful. Not only push, but pull connections and similarities or
groupings generated from other apps. Much more than grep.

And worth the resources it requires.

Derek

Social Semantic Sense for the Desktop (MIT Technology Review)

Posted Jan 8, 2009 22:35 UTC (Thu) by lysse (guest, #3190) [Link]

If you're this petegn:

http://www.bebo.com/Profile.jsp?MemberId=1562301261

then you're full of it.

Copyright © 2008, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds