Invision Community 4: SEO, prepare for v5 and dormant account notifications Matt November 11, 2024Nov 11
Posted February 16, 20241 yr Obviously, IC does not use full-text search but also lacks the basic stemming, synonyms, plurals in search. If we search for the word "consist" and there also exist words like consists, consistent, consistently, etc. these additional words are not matched in the search which does an exact word match search only making the search far from accurate. As you are aware, stemming works by using the derived variations of the word's root in the search. Searched word: manage Root: manag Stems: manage, manager, managers, manages, management, managerial, managing... Now we can use these root words with wild cards in search logic for a knock-out search. Plurals are covered by stemming (wildcard on root). https://www.roscripts.com/php_search_engine-119/ https://github.com/hugsbrugs/php-synonym https://www.google.com/search?q=download+thesaurus+database https://stackoverflow.com/questions/2475045/php-script-to-find-synonyms https://github.com/jmagnone/codeigniter-googlesearch-api https://www.hitbullseye.com/Vocab/List-of-Synonyms.php https://github.com/markfullmer/porter2 https://tartarus.org/martin/PorterStemmer https://tartarus.org/martin/PorterStemmer/php.txt https://www.javatpoint.com/stemming-words-using-python https://www.phpclasses.org/package/12888-PHP-PHP-extension-to-implement-the-Porter-stemmer.html https://www.geeksforgeeks.org/introduction-to-stemming/ https://pecl.php.net/package/stem https://en.wikipedia.org/wiki/Stemming Soundex - https://dev.mysql.com/doc/refman/8.0/en/string-functions.html#function_soundex (maybe implemented using an optional "Sounds Like" checkbox on the Search page) Please implement stemming and synonyms in search logic. Edited February 16, 20241 yr by WebCMS
February 16, 20241 yr You can try to implement Elastic search to get stemming. Edited February 16, 20241 yr by Sonya*
February 16, 20241 yr Community Expert As mentioned above, we do provide support for elastic search integration if you would like more advanced search functionality
February 16, 20241 yr Author I saw that already which is meant for large, high-traffic sites. I made the suggestion for smaller sites with some basic stemming and synonyms for a wider search coverage (vs exact word match search which is limiting and not user-friendly).
February 19, 20241 yr Author Why is Elasticsearch not included on CiC managed hosting? I understand the web, database, auth, APIs, load-balancing, media, caching, search, etc. can be hosted on separate servers but not including ElasticSearch on CiC and locating it on an external server does not make it managed. It requires self-hosting + maintenance that cost $$$ or paying subscription $100+ per month separately. Instead of each CiC client setting up EC separately, it could be setup on CiC. Those who wish to use it could just switch to it inside ACP. The current search without stemming, synonyms, plurals, etc. is limited and sub-optimal. Edited February 19, 20241 yr by WebCMS
February 19, 20241 yr This is something they've been working on previously based on other live streams, etc. It's not as simple as simply tossing up a random Opensearch/Elasticsearch instance when you talk about the size and scale that iPS operates at. In addition, there are BIG costs associated with an enterprise class platform... so it has to be done in a way that is economical for IPS as well otherwise that hosting cost could have a decent size bump!
February 20, 20241 yr Author PHP Stemming class + Thesaurus dB for synonyms => poor man's full text search for free 😀 Most sites are small-mid size and don't really need ElasticSearch Edited February 20, 20241 yr by WebCMS
February 24, 20241 yr Author If and when this is implemented, please also provide an option in the ACP to add our own niche synonyms for niche forums (in addition to thesaurus dB for synonyms).
September 14, 2024Sep 14 Author You can integrate Google Search on your site using Google's infra as an alternative search option for accurate and blazing-fast searches (including single-char searches) and offer its submenu under Activity menu. Google Search also offers custom synonyms, custom autocomplete, search-result augmentation from web, sorting, refinements, promotions, query enhancements, search settings, page restrictions, image search, safe search, sites to search, excluded sites, restricted regions, language selection, etc. and also offers APIs - https://www.telugus.com/search-google Edited September 14, 2024Sep 14 by WebCMS
September 14, 2024Sep 14 The sad part is that we can't use Elasticsearch's multilingual capabilities in IPS. Even your screenshot shows that we can only use one language analyzer, which is not suitable for multilingual websites 😢