commonsense team mailing list archive

Thread
Date

[Bug 524508] [NEW] Show top concepts for canonical docs

To: commonsense@xxxxxxxxxxxxxxxxxxx
From: Rob Speer <rspeer@xxxxxxx>
Date: Fri, 19 Feb 2010 16:05:01 -0000
Reply-to: Bug 524508 <524508@xxxxxxxxxxxxxxxxxx>
Sender: bounces@xxxxxxxxxxxxx

Public bug reported:

When a canonical document is selected, the info pane should show a list
of commonly-occuring concepts in the documents that are most similar to
it.

These should probably be pre-SVD counts, not post-SVD similarity scores.
For example, counting the number of times the words "chinese" and "thai"
occur, weighted by which documents they are in, but not weighted by
anything involving the "chinese" and "thai" concept vectors themselves.
This would reassure users that the data reflects reality, even if the
SVD comes out kind of weird.

Probably the best way to report these values would be as percentages --
this is what Dentsu does in their existing reports.

** Affects: luminoso
Importance: Medium
Status: Confirmed

** Tags: feature

** Changed in: luminoso
Importance: Undecided => Medium

** Changed in: luminoso
Status: New => Confirmed

** Changed in: luminoso
Milestone: None => 1.1

--
Show top concepts for canonical docs
https://bugs.launchpad.net/bugs/524508
You received this bug notification because you are a member of
Commonsense Computing, which is the registrant for Luminoso.

Status in Luminoso: Confirmed

Bug description:
When a canonical document is selected, the info pane should show a list of commonly-occuring concepts in the documents that are most similar to it.

These should probably be pre-SVD counts, not post-SVD similarity scores. For example, counting the number of times the words "chinese" and "thai" occur, weighted by which documents they are in, but not weighted by anything involving the "chinese" and "thai" concept vectors themselves. This would reassure users that the data reflects reality, even if the SVD comes out kind of weird.

Probably the best way to report these values would be as percentages -- this is what Dentsu does in their existing reports.

Follow ups

[Bug 524508] Re: Show top concepts for canonical docs
From: sgt101, 2010-02-20
[Bug 524508] [NEW] Show top concepts for canonical docs
From: Rob Speer, 2010-02-19

References

[Bug 524508] [NEW] Show top concepts for canonical docs
From: Rob Speer, 2010-02-19