WebCMS Posted February 16 Posted February 16 (edited) Obviously, IC does not use full-text search but also lacks the basic stemming, synonyms, plurals in search. If we search for the word "consist" and there also exist words like consists, consistent, consistently, etc. these additional words are not matched in the search which does an exact word match search only making the search far from accurate. As you are aware, stemming works by using the derived variations of the word's root in the search. Searched word: manage Root: manag Stems: manage, manager, managers, manages, management, managerial, managing... Now we can use these root words with wild cards in search logic for a knock-out search. Plurals are covered by stemming (wildcard on root). https://www.roscripts.com/php_search_engine-119/ https://github.com/hugsbrugs/php-synonym https://www.google.com/search?q=download+thesaurus+database https://stackoverflow.com/questions/2475045/php-script-to-find-synonyms https://github.com/jmagnone/codeigniter-googlesearch-api https://www.hitbullseye.com/Vocab/List-of-Synonyms.php https://github.com/markfullmer/porter2 https://tartarus.org/martin/PorterStemmer https://tartarus.org/martin/PorterStemmer/php.txt https://www.javatpoint.com/stemming-words-using-python https://www.phpclasses.org/package/12888-PHP-PHP-extension-to-implement-the-Porter-stemmer.html https://www.geeksforgeeks.org/introduction-to-stemming/ https://pecl.php.net/package/stem https://en.wikipedia.org/wiki/Stemming Soundex - https://dev.mysql.com/doc/refman/8.0/en/string-functions.html#function_soundex (maybe implemented using an optional "Sounds Like" checkbox on the Search page) Please implement stemming and synonyms in search logic. Edited February 16 by WebCMS Jimi Wikman 1
Sonya* Posted February 16 Posted February 16 (edited) You can try to implement Elastic search to get stemming. Edited February 16 by Sonya*
Marc Posted February 16 Posted February 16 As mentioned above, we do provide support for elastic search integration if you would like more advanced search functionality
WebCMS Posted February 16 Author Posted February 16 I saw that already which is meant for large, high-traffic sites. I made the suggestion for smaller sites with some basic stemming and synonyms for a wider search coverage (vs exact word match search which is limiting and not user-friendly).
WebCMS Posted February 19 Author Posted February 19 (edited) Why is Elasticsearch not included on CiC managed hosting? I understand the web, database, auth, APIs, load-balancing, media, caching, search, etc. can be hosted on separate servers but not including ElasticSearch on CiC and locating it on an external server does not make it managed. It requires self-hosting + maintenance that cost $$$ or paying subscription $100+ per month separately. Instead of each CiC client setting up EC separately, it could be setup on CiC. Those who wish to use it could just switch to it inside ACP. The current search without stemming, synonyms, plurals, etc. is limited and sub-optimal. Edited February 19 by WebCMS
Randy Calvert Posted February 19 Posted February 19 This is something they've been working on previously based on other live streams, etc. It's not as simple as simply tossing up a random Opensearch/Elasticsearch instance when you talk about the size and scale that iPS operates at. In addition, there are BIG costs associated with an enterprise class platform... so it has to be done in a way that is economical for IPS as well otherwise that hosting cost could have a decent size bump! WebCMS and Jim M 1 1
WebCMS Posted February 20 Author Posted February 20 (edited) PHP Stemming class + Thesaurus dB for synonyms => poor man's full text search for free 😀 Most sites are small-mid size and don't really need ElasticSearch Edited February 20 by WebCMS
WebCMS Posted February 24 Author Posted February 24 If and when this is implemented, please also provide an option in the ACP to add our own niche synonyms for niche forums (in addition to thesaurus dB for synonyms).
WebCMS Posted September 14 Author Posted September 14 (edited) You can integrate Google Search on your site using Google's infra as an alternative search option for accurate and blazing-fast searches (including single-char searches) and offer its submenu under Activity menu. Google Search also offers custom synonyms, custom autocomplete, search-result augmentation from web, sorting, refinements, promotions, query enhancements, search settings, page restrictions, image search, safe search, sites to search, excluded sites, restricted regions, language selection, etc. and also offers APIs - https://www.telugus.com/search-google Edited September 14 by WebCMS Joel R 1
aia Posted September 14 Posted September 14 On 2/16/2024 at 12:59 PM, Sonya* said: The sad part is that we can't use Elasticsearch's multilingual capabilities in IPS. Even your screenshot shows that we can only use one language analyzer, which is not suitable for multilingual websites 😢
Recommended Posts