<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wikidot="http://www.wikidot.com/rss-namespace">

	<channel>
		<title>Minor outage earlier today</title>
		<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today</link>
		<description>Posts in the discussion thread &quot;Minor outage earlier today&quot;</description>
				<copyright></copyright>
		<lastBuildDate></lastBuildDate>
		
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-510539</guid>
				<title>Re: Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-510539</link>
				<description></description>
				<pubDate>Tue, 16 Jun 2009 18:10:11 +0000</pubDate>
				<wikidot:authorName>cold_blood3d</wikidot:authorName>				<wikidot:authorUserId>90994</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>thanks. i was happy it didn't last too long.</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-510475</guid>
				<title>Re: Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-510475</link>
				<description></description>
				<pubDate>Tue, 16 Jun 2009 16:59:02 +0000</pubDate>
				<wikidot:authorName>bruteginkiller</wikidot:authorName>				<wikidot:authorUserId>331367</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Thanks for the info! i dont think i was on then so lucky i wasnt, i get angry very fast try not to but….it happens</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-406046</guid>
				<title>Re: Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-406046</link>
				<description></description>
				<pubDate>Fri, 06 Mar 2009 09:06:44 +0000</pubDate>
				<wikidot:authorName>michal frackowiak</wikidot:authorName>				<wikidot:authorUserId>1</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>There was one more minor outage yesterday at 21.40 UTC, but this gave us more details of what is going on. It looks like we are suffering from a bug in PHP FastCGI interface that leaves dysfunctional PHP processes, that produce "500 Internal Server Error" and leave a lot of resources open: connections to cache, database, files etc. The problem escalated really fast and effective lead to front server stopping responding, which leads to an internally-caused Denial of Service.</p> <p>We have modified the webserver configurations to: 1. prevent situations which lead to creating dysfunctional processes, 2. if it does not work, each such emergency situation is now detected automatically and fixed so that no escalation takes place, thus the server is auto-healing.</p> <p>With Wikidot growing, we often meet new frontiers and new problems. Most of them we fix or adjust to without any service interruption, but some (as this one that results from a bug in software we are using) manifests in a very nasty way. Fortunately we have a really bright team here that can quickly diagnose, react to problems and most importantly prevent problems from occurring in the first place.</p> <p>So I hope we have this one closed. Thanks for all the friendly support emails from you, Twitter posts and comments we have received! We are really happy to provide Wikidot services and we are devoted to it, and also are glad that our efforts are recognized!</p> <p>EDIT: we are looking at some future-proof improvements to our infrastructure, I put more notes <a href="http://michalfrackowiak.com/blog:wikidot-on-cloud">on my blog</a>.</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-404230</guid>
				<title>Re: Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-404230</link>
				<description></description>
				<pubDate>Wed, 04 Mar 2009 19:27:10 +0000</pubDate>
				<wikidot:authorName>michal frackowiak</wikidot:authorName>				<wikidot:authorUserId>1</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Thanks Helmuti,</p> <p>first of all, we do have "dynamic IP addresses" which we can assign to different servers when needed. In case one front-end server dies, we can easily assign the IP to another server and either serve a read-only content from Wikidot, or make a backup server display a message.</p> <p>This time however we had a problem with it — I am not sure how this works (Piotr is much better at this) but the router in the datacenter did not want to route traffic properly. Perhaps we were doing something wrong, but it did work several times before that. This is why we got the message a bit later, and for a while no servers could be reached at all.</p> <p>There was also a suggestion to set up an external site that would report status of Wikidot.com, which is also a good idea.</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-404184</guid>
				<title>Re: Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-404184</link>
				<description></description>
				<pubDate>Wed, 04 Mar 2009 18:32:33 +0000</pubDate>
				<wikidot:authorName>Helmuti_pdorf</wikidot:authorName>				<wikidot:authorUserId>17609</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Thanks for the info!</p> <p>Question: would it help to set up a "short notice" on one of the "reachable" google.groups ( our "temporary wikidot dev-list" ?)</p> <p>This link is readable for non-signed in visitors too: <a href="http://groups.google.com/group/wikidot?hl=en" >http://groups.google.com/group/wikidot?hl=en</a></p> <p>at this link I asked why (and if !) wikidot is not reachable - could be my provider has lost some DNS . . , my router is damaged …, my browser has problems.. ,</p> <p>(I for my own tried first www.openDNS,com to check if my provider has problems)</p> <p>I got the info from piotr ( by mail) - the team is working on a strange failure.. that helped realy!</p> <p>I finished my experiments to reach wikidot - and made a pause…</p> 
				 	]]>
				</content:encoded>							</item>
					<item>
				<guid>http://www.wikidot.com/forum/t-135576#post-404042</guid>
				<title>Minor outage earlier today</title>
				<link>http://www.wikidot.com/forum/t-135576/minor-outage-earlier-today#post-404042</link>
				<description></description>
				<pubDate>Wed, 04 Mar 2009 17:14:29 +0000</pubDate>
				<wikidot:authorName>michal frackowiak</wikidot:authorName>				<wikidot:authorUserId>1</wikidot:authorUserId>				<content:encoded>
					<![CDATA[
						 <p>Unfortunately we had a minor outage earlier today, exactly between 14.30 and 16.00 UTC. The problem was caused probably by an unexpected DOS attack that consumed resources of our main server, but we are still investigating it. We started working on this at 14.31. Although we did cope with the issue quite quickly, our freshly-rebooted storage array forced an all-disk check (fsck) which took more than an hour. Since the reboot was not scheduled, it halted the whole service from being taken on-line, and enormously prolonged the outage.</p> <p>We were positing status updates in real-time.</p> <p>Sorry about any inconvenience. We know people rely on Wikidot for many of their activities (we got a lot of phone-calls too), this is why we always try to prevent such incidents and fix them as soon as possible. Thanks for your understanding!</p> <p>Best regards,<br /> The Wikidot Team</p> 
				 	]]>
				</content:encoded>							</item>
				</channel>
</rss>