opentype Posted March 17, 2018 Posted March 17, 2018 2 minutes ago, PPlanet said: In a way it makes sense, but I suppose that Google warns me because it sees them in the sitemap yet can't access them. So change it if it bothers you. PPlanet 1
mark007 Posted March 17, 2018 Posted March 17, 2018 2 hours ago, PPlanet said: I have blocked search engines from accessing members profiles. Google sees this as an HTTP Error: 403 and keeps warning me about it. Are guests able to see the profiles?
mark007 Posted March 17, 2018 Posted March 17, 2018 By the way ... Google says: Indexed and not in sitemap transmitted
PPlanet Posted March 18, 2018 Posted March 18, 2018 7 hours ago, mark007 said: Are guests able to see the profiles? No, they are not. Cheers.
mark007 Posted March 18, 2018 Posted March 18, 2018 5 hours ago, PPlanet said: No, they are not. Cheers. 15 hours ago, PPlanet said: In a way it makes sense, but I suppose that Google warns me because it sees them in the sitemap yet can't access them. That's strange. If guests can not see profiles, then they are not included in the sitemap - at least not for me.
mark007 Posted March 18, 2018 Posted March 18, 2018 On 16.2.2018 at 8:45 PM, bfarber said: I wouldn't nofollow internal embeds (those are essentially links to other pages on your site which you absolutely do want to be followed and page rank passed to), however if you visit that link and view the page source, a canonical tag is already set to the real topic URL, so Google should be able to follow that link and index it, but point back to the canonical URL. But what about these links? topic/*?do=findComment* topic/*tab=comments* *?page=1$ *?page=0$ *?view=getnextunread* *?do=getNewComment *?do=getLastComment *?do=reportComment* In robots.txt I set now the disallow directive. Does it make sense? And should not IPS set a redirect to ?page=1 and ?page=0 to the topic ending with / ... ? Because Google tells me, that there are 3 different URL's with the same content (duplicate content).
mark007 Posted March 18, 2018 Posted March 18, 2018 On 15.3.2018 at 8:52 AM, Upgradeovec said: But one more interesting thing - i added robots.txt 2018-03-03 and graph 'index count' boost x3.. Just because i block some links, which google tried to get, but get error or get content-less page. This errors may worse indexing of correct links. May be not, who knows Would you post your robots.txt?
opentype Posted March 18, 2018 Posted March 18, 2018 29 minutes ago, mark007 said: That's strange. If guests can not see profiles, then they are not included in the sitemap - at least not for me. It was explained already. A third-party app is used to block the profile pages, independent from the IPS core settings.
Duken Posted March 18, 2018 Posted March 18, 2018 On 9-1-2018 at 9:18 AM, Upgradeovec said: Did it. Before: After: No issues detected by several sitemap online checking tools: I did it very ugly. Just for try and check. You can improve it by yourself (and share it with us, please): /applications/core/extensions/core/Sitemap/Content.php line 209: after $data line add that: if (get_class($node) === 'IPS\forums\Forum' && isset($node->last_post)) { $data['lastmod'] = $node->last_post; } and line 259 (line 262 after add previous) add after $data line that: if (get_class($item) === 'IPS\forums\Topic' && isset($item->last_post)) { $data['lastmod'] = $item->last_post; } After that the sitemap script should re-generate all sub-sitemaps for write new data to db. And I haven't done changing correct lastmod in index sitemap, depended on newer date inside sub-sitemap. Thanks. How did you changed this? My content.php from line 209: $data = array( 'url' => $node->url() ); $priority = intval( isset( $settings["sitemap_{$nodeClass::$nodeTitle}_priority"] ) ? $settings["sitemap_{$nodeClass::$nodeTitle}_priority"] : self::RECOMMENDED_NODE_PRIORIY ); if ( $priority !== -1 ) { $data['priority'] = $priority; $entries[] = $data; } } } } So i need to add: if (get_class($node) === 'IPS\forums\Forum' && isset($node->last_post)) { $data['lastmod'] = $node->last_post; } Like this? 209: $data = array( 'url' => $node->url() ); 209: $data = if (get_class($node) === 'IPS\forums\Forum' && isset($node->last_post)) { $data['lastmod'] = $node->last_post; }
mark007 Posted March 18, 2018 Posted March 18, 2018 No, you have to add the code after the line: After: $data = array( 'url' => $node->url() ); if (get_class($node) === 'IPS\forums\Forum' && isset($node->last_post)) { $data['lastmod'] = $node->last_post; } Numbered 1
AlexWebsites Posted March 18, 2018 Posted March 18, 2018 2 hours ago, mark007 said: No, you have to add the code after the line: After: $data = array( 'url' => $node->url() ); if (get_class($node) === 'IPS\forums\Forum' && isset($node->last_post)) { $data['lastmod'] = $node->last_post; } @mark007 Has this worked with getting more of your sitemap indexed? Would have been nice if they just worked this in to 4.2.8.
mark007 Posted March 20, 2018 Posted March 20, 2018 On 18.3.2018 at 10:35 PM, AlexWebsites said: Has this worked with getting more of your sitemap indexed? I didn't implement it ...
sadams101 Posted March 29, 2018 Posted March 29, 2018 I did implement all mods and am seeing positive results since I did it. At a low point I had 111,000 indexed, which really makes no sense at all given that I have nearly 1M posts, well over 100K topics, and an article site also that has 5K articles. As you can see it is going up fast now, and is up to 187,000. Since there are built in canonical links, I would not block anything in the robots.txt file: sudo, CSSlife, SeNioR- and 1 other 3 1
Duken Posted April 14, 2018 Posted April 14, 2018 On 3/30/2018 at 1:48 AM, sadams101 said: I did implement all mods and am seeing positive results since I did it. At a low point I had 111,000 indexed, which really makes no sense at all given that I have nearly 1M posts, well over 100K topics, and an article site also that has 5K articles. As you can see it is going up fast now, and is up to 187,000. Since there are built in canonical links, I would not block anything in the robots.txt file: Hi, could you tell is how the index is going at the moment? If its good ill implement is alsof. Thx.
Dll Posted April 14, 2018 Posted April 14, 2018 On 3/17/2018 at 3:21 PM, mark007 said: There are a few problems in indexing pages: https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/?page=0 is the same as https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/?page=1 and the same as https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/ There isn't a problem with those pages, as the canonical is set to the correct one on all of them: <link rel="canonical" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/" /> bfarber and Daniel F 2
Duken Posted April 15, 2018 Posted April 15, 2018 (edited) Will the sitemap generation be changed in 4.3? If it is, ill wait for the update. - Google was / is removing links from the index on my site to. Reactions on (older) forum topics are taking forever to be indexed. Edited April 15, 2018 by Duken added index
sadams101 Posted April 17, 2018 Posted April 17, 2018 (edited) POST EDITED: I removed my rant...I looked at the incorrect link and posted that the pagination was not included in the canonical links, but I was incorrect, it is included... PS - There is an issue with the mycustomsiteupdater.php file here that causes too many resources to be used, so I would not use it. The mod that seems to have done the trick is the <lastmod> date that was added early in this thread. Edited April 17, 2018 by sadams101 Incorrect info.
Dll Posted April 17, 2018 Posted April 17, 2018 (edited) 2 hours ago, sadams101 said: I disagree with the canonical link solving this issue. In fact, the canonical link is simply wrong here. IT SHOULD SAY ?page=1, 2, etc., because the unique information on all of those unique pages that are being indexed is being sent to the wrong place. <snip> Errm, not sure what you're looking at there, as it's all set correctly. As per your original example, if there was a link to page 0 or page 1, they have a canonical of: <link rel="canonical" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/" /> Which makes sense, since those are all essentially the first page of the thread, so don't require (and shouldn't have) the pagination in the url for seo purposes. So all good so far. But for page 2, the canonical is: <link rel="canonical" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/?page=2" /> Page 3 has a canonical to ?page=3, and so on. On top of that, Invision also have the tags to let google know it's a paginated thread, which again is good seo, as google then knows to link the pages together as one set, and it may also show the page links in search. For instance on page 2, the tags are: <link rel="first" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/" /> <link rel="prev" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/" /> <link rel="next" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/?page=3" /> <link rel="last" href="https://invisioncommunity.com/forums/topic/442742-large-community-you-have-a-problems-with-sitemap/?page=8" /> So no need to rant, and no need to get someone to fix it for you, as it's already been done ? Edited April 17, 2018 by Dll
sadams101 Posted April 17, 2018 Posted April 17, 2018 (edited) I edited my first reply...I do see the pagination on the later category pages, so my bad! I must have been looking at the wrong link in view page...sorry! Is anyone else using the sitemap cronjob at the start of this thread? It creates some memory issues, even though I have 130Gigs. Edited April 17, 2018 by sadams101
ProSkill Posted April 27, 2018 Posted April 27, 2018 Anyone have updates to report after moving to 4.3? I noticed that the number of links in my sitemap is now significantly smaller. I am not sure why that is, I ran the sitemap rebuild function. It's too early to notice any ranking changes.
Fast Lane! Posted April 27, 2018 Posted April 27, 2018 Scary. This is why I generally wait a few point releases to upgrade. Sounds like many bugs are abound on 4.3 which isn't surprising for a large update.
sadams101 Posted May 2, 2018 Posted May 2, 2018 I know this was pointed out already, but links like this are not in the site map: https://www.celiac.com/forums/topic/102448-what-brand-of-pasta-do-you-all-buy/?page=2 and google is throwing warnings about this in their "Indexed, not submitted in sitemap" area. Is there a reason these are not in the sitemap? Is all the content in the sitemap anyway, so including the pagination doesn't matter? PS - I am not yet on 4.3...is this why I can't find the settings mentioned that allow the profiles to be searchable/indexable by google? sobrenome 1
bfarber Posted May 2, 2018 Posted May 2, 2018 12 hours ago, sadams101 said: I know this was pointed out already, but links like this are not in the site map: https://www.celiac.com/forums/topic/102448-what-brand-of-pasta-do-you-all-buy/?page=2 and google is throwing warnings about this in their "Indexed, not submitted in sitemap" area. Is there a reason these are not in the sitemap? Is all the content in the sitemap anyway, so including the pagination doesn't matter? PS - I am not yet on 4.3...is this why I can't find the settings mentioned that allow the profiles to be searchable/indexable by google? We only include the first page in the sitemap. Google is able to index the rest of the pages (as you see there).
AlexWebsites Posted May 6, 2018 Posted May 6, 2018 On 1/16/2018 at 5:56 AM, Matt said: We've added the timestamp into the sitemap and we're looking to add a tool to quickly rebuild the sitemap on demand. @Matt is there a tool in 4.3 to rebuild sitemaps on demand? I didn't see one. I updated one of my sites to 4.3 and do not see timestamps for forum topics, assuming it just needs to be rebuilt after upgrade.
Recommended Posts