← Back to team overview

commonsense team mailing list archive

[Bug 445125] Re: non-normalized concepts exist

 

In working on fixing this, I've stumbled across a suboptimal decision I
made a while ago. I wanted the concept "people" to not match its
normalized form, which MBLEM thinks is still "people", singular. So I
associated the SurfaceForm "people" with the concept "person" manually,
and I guess I assumed that we'd be using SurfaceForms to do nl
normalization. It seemed like a good way to override special cases.

This was probably dumb. We want to be able to do normalization without
having the database at all. I'm going to try to convince MBLEM that
"people" is foremost the plural of "person".

-- 
non-normalized concepts exist
https://bugs.launchpad.net/bugs/445125
You received this bug notification because you are a member of
Commonsense Computing, which is the registrant for ConceptNet.

Status in ConceptNet API: New

Bug description:
I noticed that some concepts seem to be not normalized:

>>> Concept.get('balls', 'en')
<Concept: <en: balls>>
>>> Concept.get('ball', 'en')
<Concept: <en: ball>>
>>> Concept.get('balls', 'en').surfaceform_set.all()[0]
<SurfaceForm: balls>
>>> Concept.get('balls', 'en').get_assertions().count()
45

Where'd that come from?



Follow ups

References