semscapes

HD Maps: And Where Are The F*&*() Topics?

(continued from Part III)

Now that I know where the documents are located in the landscape, I have experimented with ways to estimate where the topic map topics are supposed to be. My hypothesis is that if I can determine the distance of each document to every topic, I can triangulate the topics.

Topic Positions

Below (larger version in the attachments) is a new rendering of the MapReduce theme:

http://kill.devc.at/system/files/x4-small.jpg

It shows the themes derived from the semantic corpus (documents + semantic network). Compare this with the positions of topics:

Posted In

Use Case: Semantic Document Recovery

(continued from HD Semantic Maps)

Like most of you, I collect bookmarks. But unlike most of you, I store them into a semantic network, a topic map to be precise.

One problem I certainly share with you, is that all these laboriously collected links are prone to break. To recover them sometimes needs considerable effort and - according to another Murphy Law (are there actually any other laws?) - always hits you at the most inappropriate time.

Posted In

High-Definition Semantic Maps (Part IV)

(continued from Part III)

Lately I invested more work in the backend server (TM::IP) to also host the document positions: Positions of those documents which - together with the underlying semantic network - form the landscape.

http://kill.devc.at/system/files/mapreduce-small.jpg

The theme is still MapReduce, but with considerable more content than before.

Seamless document access

On top of Seadragon I then implemented a bit of mouse hover logic to be able to preview HTML and PDF pages directly onto of the map.

**Play with it!**

But if you break it, you buy it.

Posted In

Graphics::DZI Developer Release

This weekend I managed to package up Graphics::DZI. It is a naive implementation of the DeepZoom mechanism as used in Seadragon.

The API is not completely stable; first I will have to integrate the piece into my semantic map generation infrastructure.

I also need to better understand how to deal with very sparse maps.

Posted In

High-Definition Semantic Maps (Part III)

(continued from Part II)

This week I have toyed around with ways to zoom in and out of the map. After some experimentation I ended up with Seadragon, a Microsoft acquisitioned technology.

http://kill.devc.at/system/files/mapzoom-small.jpg

Try it out yourself.

Posted In

High-Definition Semantic Maps (Part II)

(continued from Part I)

One of the questions you might rightfully ask, is how much impact the semantic network information within the topic map has on producing visualisations like those below:

http://kill.devc.at/system/files/mr-wo-docs-small.jpg

Or how much they should have, as this is a parameter which I must control.

Posted In

High-Definition Semantic Maps (Part I)

This is my first stab at a realistic data set (see the attachments for the original resolution):

http://kill.devc.at/system/files/mr-wo-docs-small.jpg

It shows the landscape around the theme MapReduce, a cloud computing technology about which semantic web people may or may not have heard.

Posted In

TM::IP RESTful in Peace

I had mentioned earlier that have now reorganized my new TM server (based on Catalyst/mod_perl/Apache) along the REST paradigm. In my case this means that not only TM data, but also documents attached to it, vector spaces, and so forth are exposed RESTfullish.

At first this appeared to be more RESTfoolish as it was quite difficult to squeeze everything into a GET/PUT/POST corset. And it also was much more work than I had planned to invest, mostly because not only the original resources, but also all machine learning processes have to be exposed, and if it is only their configuration parameters. And they have plenty.

But I seem to have reaped the benefits much earlier than anticipated. Read on.

Posted In

Announce: Graph::PetriNet on CPAN

One of the problems I have to solve for my infrastructure is to compute semantic landscapes ("SemScapes" if you were so marketing-ish inclined) with an efficient computation model.

If, for instance, a user has added a new document to the document corpus, then new feature vectors, after that new vector spaces, new convergence models, new landscapes and new maps (as images) have to be generated.

There is a dependency graph, quite similar to one you are used with tools like make.

Posted In

TM::IP RESTful map surfaces

One of the many pieces in my puzzle are surfaces of topic maps. When computed these are simply PNG files. There will be different resolutions of these surface (maps), depending on how much content is involved.

As I want to integrate this into my TM::IP landscape, the best is to follow the pattern I used with TM::IP::Documents and have another Catalyst controller doing the work.

Posted In

Topic Maps are Maps!

A while back I ranted that topic maps are normally not visualized as (quasi) geographical maps. I argued that the map metapher is so natural to most of us that lifting it into a semantic space is worth a try.

http://kill.devc.at/system/files/test.jpg

Posted In