Jump to content

Suggestion: Easier User Agents


Lee Craven

Recommended Posts

Posted

Why are the user agents not simple like they were in 2.x.x? I cannot for the life of me figure out why when I see I have a guest online check the IP and reverse lookup the IP it shows as googlebot, yet it doesn't show in my online list, so the bot is crawling yet it doesn't say that 'Google' is online, I have it set to show in the online list, and allow the search engines to browse as members yet 99% of the time they don't show in the online list, I think I've seen google show as "online" maybe twice since IPB3 Final. When I was using 2.x.x it always showed all the bots when they were online, and it was a simple list to import into the ACP, now it's all this crazy regex stuff that I cannot for the life of me figure out...

Example:

Googles Regex is:


IPS have no documentation what so ever on how to work with this new system, I for one have NO clue, any chance it can be more user friendly for the typical end user, all I want is to display the bots in the online list when they are on my site?

I don't have access to raw logs so I can't even find out the info from the logs, I've seen 4 different bots online now that are in the user agents list yet they DO NOT show in the online list.

Googl(e|ebot)(-Image)?/
Posted

Would I be wrong to say editing the regex to say 'googlebot.com' would work? I noticed a search engine called Cuil.com on my site, I made a new user agent (search engine) and entered 'cuil.com'in the regex and it shows in the online list.

Posted

Realistically, you should escape the "." with a backslash.

googlebot\.com

The problem is, google uses more than one user agent, hence the more complicated regular expression.

Posted

Brandon, so is it possible to add the search engines like I explained above with like eg: 'bestsearchengine.com' - regex= "bestsearchengine\.com" - could that be a universal basic method that would work in most cases?

The regex info is so hard for me to grasp. And without having access to the root logs I'm none the wiser as my host DOES NOT allow them. (godaddy can be a pain).

Posted

If you don't mind, I have a question regarding this subject.

In previous versions, I was able to replace spiderbots names with an image. For example, I could show in the index page something like this: logo_google_index.gif when googlebot was online.

How can I do that in IPB3?

Posted

[quote name='Lee Craven' date='07 July 2009 - 09:25 PM' timestamp='1247016346' post='1822110']
Brandon, so is it possible to add the search engines like I explained above with like eg: 'bestsearchengine.com' - regex= "bestsearchengine.com" - could that be a universal basic method that would work in most cases?

The regex info is so hard for me to grasp. And without having access to the root logs I'm none the wiser as my host DOES NOT allow them. (godaddy can be a pain).


Yes


[quote name='Fórum Jurídico' date='07 July 2009 - 10:14 PM' timestamp='1247019262' post='1822130']
If you don't mind, I have a question regarding this subject.

In previous versions, I was able to replace spiderbots names with an image. For example, I could show in the index page something like this: when googlebot was online.

How can I do that in IPB3?


Please use the resource forums for assistance modifying/customizing your board.

Posted

I tried having cuil\.com and it didn't show, I changed it to cuil.com and it showed in the online list (was recognized) I just don't understand the regex info.

This is all too confusing, wish it was more simple.

  • 5 months later...
Posted

Have you tried clicking the "Get Help With This Page" link on the user agent page?
It will take you to here: http://community.invisionpower.com/resources/official.html?record=293

If you feel that article can be improved, you need to let us know what it's missing... that is what the comment feature is for.

Although it does answer the question being asked:


If you're interested in learning more about Regex, this is a good place to start.
With regards to "." vs "\." though - "." in Regex represents any character, so for the sake of detecting "bestsearchengine.com" - it doesn't really matter if you enter "bestsearchengine.com" or "bestsearchengine\.com".

Regex - This is what IP.Board actually uses to detect if the user is using this user agent. It should be a regular expression which will be performed on the user agent string - if it matches, IP.Board will match the user with this user agent. For the simplest use, you can just enter in anything that will be in the user agent - for example, you could just enter "googlebot" for Google.

Archived

This topic is now archived and is closed to further replies.

  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...