Jump to content

Why is search setup to fail with full-text stop words?


SJ77

Recommended Posts

Posted

Any word that is defined as a full-text stop word in MySQL I was told will fail in search. This is a pretty big list of common words. My users have searched for a few of them and found search to be of poor quality. They don't accept , "well that's a full-text stop word, so you can't search for it".  They simply assume "search doesn't seem to work"  (to quote some feedback from my users)

Surely there is a better way to do this. I know other search's for competing frameworks can search these words.

IS there a way to  get around these limitations?

EDIT: I was told by the dev team that all of these words would fail. Yet, some of them seem to work here. Can someone explain @Ryan Ashbrook I am so confused.

 

Words that (supposedly) can't  be searched

a's able about above according
accordingly across actually after afterwards
again against ain't all allow
allows almost alone along already
also although always am among
amongst an and another any
anybody anyhow anyone anything anyway
anyways anywhere apart appear appreciate
appropriate are aren't around as
aside ask asking associated at
available away awfully be became
because become becomes becoming been
before beforehand behind being believe
below beside besides best better
between beyond both brief but
by c'mon c's came can
can't cannot cant cause causes
certain certainly changes clearly co
com come comes concerning consequently
consider considering contain containing contains
corresponding could couldn't course currently
definitely described despite did didn't
different do does doesn't doing
don't done down downwards during
each edu eg eight either
else elsewhere enough entirely especially
et etc even ever every
everybody everyone everything everywhere ex
exactly example except far few
fifth first five followed following
follows for former formerly forth
four from further furthermore get
gets getting given gives go
goes going gone got gotten
greetings had hadn't happens hardly
has hasn't have haven't having
he he's hello help hence
her here here's hereafter hereby
herein hereupon hers herself hi
him himself his hither hopefully
how howbeit however i'd i'll
i'm i've ie if ignored
immediate in inasmuch inc indeed
indicate indicated indicates inner insofar
instead into inward is isn't
it it'd it'll it's its
itself just keep keeps kept
know known knows last lately
later latter latterly least less
lest let let's like liked
likely little look looking looks
ltd mainly many may maybe
me mean meanwhile merely might
more moreover most mostly much
must my myself name namely
nd near nearly necessary need
needs neither never nevertheless new
next nine no nobody non
none noone nor normally not
nothing novel now nowhere obviously
of off often oh ok
okay old on once one
ones only onto or other
others otherwise ought our ours
ourselves out outside over overall
own particular particularly per perhaps
placed please plus possible presumably
probably provides que quite qv
rather rd re really reasonably
regarding regardless regards relatively respectively
right said same saw say
saying says second secondly see
seeing seem seemed seeming seems
seen self selves sensible sent
serious seriously seven several shall
she should shouldn't since six
so some somebody somehow someone
something sometime sometimes somewhat somewhere
soon sorry specified specify specifying
still sub such sup sure
t's take taken tell tends
th than thank thanks thanx
that that's thats the their
theirs them themselves then thence
there there's thereafter thereby therefore
therein theres thereupon these they
they'd they'll they're they've think
third this thorough thoroughly those
though three through throughout thru
thus to together too took
toward towards tried tries truly
try trying twice two un
under unfortunately unless unlikely until
unto up upon us use
used useful uses using usually
value various very via viz
vs want wants was wasn't
way we we'd we'll we're
we've welcome well went were
weren't what what's whatever when
whence whenever where where's whereafter
whereas whereby wherein whereupon wherever
whether which while whither who
who's whoever whole whom whose
why will willing wish with
within without won't wonder would
wouldn't yes yet you you'd
you'll you're you've your yours
yourself yourselves zero    
Posted
1 hour ago, superj707 said:

IS there any reason to not do this? like what's the downside of making all these words not stop fullsearch?

It is a default list of words that are so common as to be arguably 'useless' in a search term, as the majority of rows in the database would be likely to be a result.
I would encourage you to note that even making this list much smaller, many of the words may still return no result if using MyISAM  due to the limitation listed here: http://dev.mysql.com/doc/refman/5.7/en/fulltext-natural-language.html

Posted
1 minute ago, Marcher Technologies said:

It is a default list of words that are so common as to be arguably 'useless' in a search term, as the majority of rows in the database would be likely to be a result.
I would encourage you to note that even making this list much smaller, many of the words may still return no result if using MyISAM  due to the limitation listed here: http://dev.mysql.com/doc/refman/5.7/en/fulltext-natural-language.html

I feel like it makes more sense to see MANY results as opposed to no results. If I am a user and I get MANY results I think "oh, I need to make my search more specific". However, if I am a user and I get no results I think "oh, search here is broken"

Posted
Just now, superj707 said:

I feel like it makes more sense to see MANY results as opposed to no results. If I am a user and I get MANY results I think "oh, I need to make my search more specific". However, if I am a user and I get no results I think "oh, search here is broken"

Actually not arguing that, just providing information, as that stopwords list changed may not be enough due to that MyISAM limitation.

Posted
1 minute ago, Marcher Technologies said:

Actually not arguing that, just providing information, as that stopwords list changed may not be enough due to that MyISAM limitation.

I know, I am just adding some follow up thought to the idea.

Posted
On 2/2/2016 at 5:10 PM, superj707 said:

I feel like it makes more sense to see MANY results as opposed to no results. If I am a user and I get MANY results I think "oh, I need to make my search more specific". However, if I am a user and I get no results I think "oh, search here is broken"

That's one of the reasons I hated seeing sphinx eliminated without a replacement already in place.  Kind of like going to the back door and throwing out the wash water and going "Oops... threw the baby out with the water". :rofl:

Posted

Another thing that REALLY gets me down is the inability to search in custom fields. It feels like it eliminates 50% of the point of even having custom fields. I've looked into modifying the existing search engine to include custom fields, but it seems like a really difficult task. I've also asked IPS if they plan to improve this, and they said they don't know.

  • 2 months later...
Posted

Im in the same boat here, one thing that I don't like with IPS is the search function. I seem to get a lot of zero results.

1. My understanding is that it is a limitation of mysql and that if i want i can change it to search for 2 letter or 3 letter words?

2. Also it is my understanding i can reduce the stop words?

My site is hosted with IPS how do i do these 2 items?

Archived

This topic is now archived and is closed to further replies.

  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...