Lee Craven Posted July 6, 2009 Posted July 6, 2009 Why are the user agents not simple like they were in 2.x.x? I cannot for the life of me figure out why when I see I have a guest online check the IP and reverse lookup the IP it shows as googlebot, yet it doesn't show in my online list, so the bot is crawling yet it doesn't say that 'Google' is online, I have it set to show in the online list, and allow the search engines to browse as members yet 99% of the time they don't show in the online list, I think I've seen google show as "online" maybe twice since IPB3 Final. When I was using 2.x.x it always showed all the bots when they were online, and it was a simple list to import into the ACP, now it's all this crazy regex stuff that I cannot for the life of me figure out... Example: Googles Regex is: IPS have no documentation what so ever on how to work with this new system, I for one have NO clue, any chance it can be more user friendly for the typical end user, all I want is to display the bots in the online list when they are on my site? I don't have access to raw logs so I can't even find out the info from the logs, I've seen 4 different bots online now that are in the user agents list yet they DO NOT show in the online list.Googl(e|ebot)(-Image)?/
Lee Craven Posted July 7, 2009 Author Posted July 7, 2009 I guess I'm the only one with this problem :(
Lee Craven Posted July 7, 2009 Author Posted July 7, 2009 Would I be wrong to say editing the regex to say 'googlebot.com' would work? I noticed a search engine called Cuil.com on my site, I made a new user agent (search engine) and entered 'cuil.com'in the regex and it shows in the online list.
bfarber Posted July 7, 2009 Posted July 7, 2009 Realistically, you should escape the "." with a backslash. googlebot\.com The problem is, google uses more than one user agent, hence the more complicated regular expression.
Lee Craven Posted July 8, 2009 Author Posted July 8, 2009 Brandon, so is it possible to add the search engines like I explained above with like eg: 'bestsearchengine.com' - regex= "bestsearchengine\.com" - could that be a universal basic method that would work in most cases? The regex info is so hard for me to grasp. And without having access to the root logs I'm none the wiser as my host DOES NOT allow them. (godaddy can be a pain).
Fórum Jurídico Posted July 8, 2009 Posted July 8, 2009 If you don't mind, I have a question regarding this subject. In previous versions, I was able to replace spiderbots names with an image. For example, I could show in the index page something like this: when googlebot was online. How can I do that in IPB3?
bfarber Posted July 8, 2009 Posted July 8, 2009 [quote name='Lee Craven' date='07 July 2009 - 09:25 PM' timestamp='1247016346' post='1822110'] Brandon, so is it possible to add the search engines like I explained above with like eg: 'bestsearchengine.com' - regex= "bestsearchengine.com" - could that be a universal basic method that would work in most cases? The regex info is so hard for me to grasp. And without having access to the root logs I'm none the wiser as my host DOES NOT allow them. (godaddy can be a pain). Yes [quote name='Fórum Jurídico' date='07 July 2009 - 10:14 PM' timestamp='1247019262' post='1822130'] If you don't mind, I have a question regarding this subject. In previous versions, I was able to replace spiderbots names with an image. For example, I could show in the index page something like this: when googlebot was online. How can I do that in IPB3? Please use the resource forums for assistance modifying/customizing your board.
Lee Craven Posted July 8, 2009 Author Posted July 8, 2009 I tried having cuil\.com and it didn't show, I changed it to cuil.com and it showed in the online list (was recognized) I just don't understand the regex info. This is all too confusing, wish it was more simple.
Lee Craven Posted July 10, 2009 Author Posted July 10, 2009 If anyone figures all this out, please contact me. Thanks
cooldude7273 Posted July 11, 2009 Posted July 11, 2009 Is there any way to add a mass amount of bots?
Chetan105 Posted December 27, 2009 Posted December 27, 2009 Bump . This information is important for atleast some of us. IPB wake up give us any info on getting regex info ASAP>
Mark Posted December 27, 2009 Posted December 27, 2009 Have you tried clicking the "Get Help With This Page" link on the user agent page? It will take you to here: http://community.invisionpower.com/resources/official.html?record=293 If you feel that article can be improved, you need to let us know what it's missing... that is what the comment feature is for. Although it does answer the question being asked: If you're interested in learning more about Regex, this is a good place to start. With regards to "." vs "\." though - "." in Regex represents any character, so for the sake of detecting "bestsearchengine.com" - it doesn't really matter if you enter "bestsearchengine.com" or "bestsearchengine\.com".Regex - This is what IP.Board actually uses to detect if the user is using this user agent. It should be a regular expression which will be performed on the user agent string - if it matches, IP.Board will match the user with this user agent. For the simplest use, you can just enter in anything that will be in the user agent - for example, you could just enter "googlebot" for Google.
Recommended Posts
Archived
This topic is now archived and is closed to further replies.