Some of you need to realize this could've happened to any place where the site is being hosted. Just because it happened with Curse doesn't make it impossible to happen anywhere else. It was extremely bad luck, and now they know how to prepare for it better. This will be a learning experience for them. Nobody is perfect, and all it takes it one bad day (or a couple of bad days) to learn how to prepare for the future.
-------------
"Why did Curse administrators take the long way around and take out four days of site usability instead of immediately getting working copies of sites up so at least they were usable?"
You have to remember that the Minecraft forums + wiki aren't the only sites they are hosting. It will take time to move all of their sites over to a different server. This will include files and the database. Then, they'll have to edit each of their sites to make them work correctly with the new server, this includes things like database connections.
Most likely the backup server won't be as powerful as their current one, so they'll have to go through all of their sites and strip various features that could add pressure to the server.
This is all assuming they have a backup server.
Their future plans should invest into more than one data center + servers and have mirror copies of their sites. This way, if one place fails (hardware, power failure, weather), the other mirror sites should still be working fine.
Well I must say they are pretty dumb, if their backup servers don't work either. Who was checking those things?
Stuff happens. It is called real life and if you had bothered to actually read the OP you would realize that there were multiple failures including stuff that was completely beyond their control. More to the point, once the whole network of servers went down, they didn't dare simply flip the switch to put everything back on right away.
I, for one, am very glad that they took their time to do it right. Had they not done so, posts would have been lost or much more bitching about how various parts of the forums weren't working or pages missing from the wiki and all sorts of other potential problems could have happened.
The only part of the summary that I find a little bit offensive is the claim it only took 24 hours to fix the problems. The site has been going up and down for much longer than just one day. Futhermore, I don't really understand why other parts of the "CURSE Network" came up but the Minecraft stuff didn't. I'm sure some of that is due to the size of the Minecraft community and because so much of the community is "officially" using these servers. Just be honest with us, and don't try to sugar coat what must have been a network technician's worst nightmare come true. Don't try to downplay the extent of the problems.
Also for the kiddies to note, this was also a financial disaster in terms of not just having to replace equipment (which costs $$$) and a whole bunch over technician overtime (even more $$$), but it also cost advertising revenue (still more $$$) not to mention a loss of goodwill (where many in the community may go elsewhere). I'm sure these guys are completely aware of that, but give them a break and don't go rubbing it into their face.
Minecraft should stay away from that sh*tty curse network. they got their fingers in their asses all day long, while minecraft is actually attempting to get somewhere...
Ditch CURSE, As a Pre-Paying owner of minecraft, i like to be able to access the forums when i need to....Not when f- curse say so...
For those who think that CURSE "owes" you something because you paid for Minecraft, that is pure and utter BS on your part. You paid for the game, not forum access. Did anybody ever have problems actually playing the game during this time period? I thought not. Your "pre-paying owner of Minecraft" has nothing to do with this whole thing so crawl back under your rock and ditch that holier than thou attitude. Mojang did not promise you anything in terms of forum access when you purchased this game.
You can "Ditch CURSE" if you care, because there are other forum communities out there. Look them up, and I for one would be glad if somebody who is so petty like this would leave the "community" here on these forums on CURSE. Don't let the door slam you in the behind as you leave. I'm not going to miss people with an attitude like this.
Oh,so it was a failer...Me and my buddy on his server thought that the wiki and forums we're down cause of it might being an update to 1.7,so it was just changing...XD!Well it's good to know about what REALLY happend.
Well,to find out what version you have,just go to the Title Screen (The screen with options and where it says MINECRAFT), and it should be right at the Top Left Corner...
_
/ \
|
| + /
|------
\
and yes the LEFT arrow looks like a tilted Pick-Axe
Well, glad the site's back up anyway. My question is, Did lulzsec play a part in this? I read that lulzsec was screwing around with some kind of minecraft-related thing. I am aware that it wasn't just the minecraft areas of Curse that were effected, but I haven't been able to find anything else about Minecraft that recently had downtime.
Rollback Post to RevisionRollBack
NO. Don't click New Thread. Read the rules. I don't give a damn how long or boring they are. If you do not, you will be subject to massive flaming and criticism by yours truly.
The reason that Minecraft Forum and Minecraft Wiki were so late to come up is because after a 40 hour check disk (data and hardware integrity check) we noticed our Linux servers (which the MC sites run on) couldn't access the SAN array. In order to give them access we were told by our hardware vendor we needed to update the firmware for our SAN infrastructure. In order to do that we wanted to back everything up off of it just in case something went wrong. This was the 2nd part that took so long.
After doing all of that we were finally able to get our Linux servers access to their data and have them running again.
It wasn't as simple as just having a backup, it wasn't as simple as just copying files. I know from the outside it may seem that way but if it was we would've had the site up long ago.
The only part of the summary that I find a little bit offensive is the claim it only took 24 hours to fix the problems. The site has been going up and down for much longer than just one day. Futhermore, I don't really understand why other parts of the "CURSE Network" came up but the Minecraft stuff didn't. I'm sure some of that is due to the size of the Minecraft community and because so much of the community is "officially" using these servers. Just be honest with us, and don't try to sugar coat what must have been a network technician's worst nightmare come true. Don't try to downplay the extent of the problems.
The 24 hour figure was the 24 hours of Windows running check disk on a many terabyte RAID 50 array.(This was one portion of the disks. See Xtek's post above for the full time.)
After about 48 hours we were able to bring the Windows .NET sites online. They were able to mount their volumes from the SAN. The bug listed was only affecting our Linux based sites which includes MCF.
There are three of our top technicians taking a very long sleep out in Atlanta right now after what certainly was a technician's worst nightmare. Plus the five or so hardware company representatives that were there or on the phone.
Well, glad the site's back up anyway. My question is, Did lulzsec play a part in this? I read that lulzsec was screwing around with some kind of minecraft-related thing. I am aware that it wasn't just the minecraft areas of Curse that were effected, but I haven't been able to find anything else about Minecraft that recently had downtime.
No, they had nothing to do with this. This was entirely a hardware failure.
Also, guys, realize that a vast majority of the Curse sites were online in Read-Only mode shortly after the servers went down. However, a site like MinecraftForums can't benefit from a read only mode, as forums write data to disks.
It's just a messy, annoying, and ironic failure. There's nothing anyone could have done to predict it, and they acted as fast as they possibly could to get the site back.
Also, realize this could have ended a lot worse. Say the HDD array failed so bad that we lost data. That would be bad, and chances are, we'd be down significantly longer than we were.
Also, you could still play Minecraft while the forums were down. You paid for the game, not unrestricted access to the forums and website.
Welcome back guys! I do have a few questions that maybe someone knowing about these server array thingamabobs could answer:
1. How likely is it that this sort out of equipment malfunction would occur in the first place?
2. Why did Curse administrators take the long way around and take out four days of site usability instead of immediately getting working copies of sites up so at least they were usable?
3. Would it be possible to shorten the time needed to restore and reinstall everything if the same hardware failure should occur again?
4. In the foreseeable future is there anything Curse could do about making site backups and restorations less dependent on critical hardware components like the ones which failed?
5. It seems possible from reading about the hardware that failed, Curse could remotely store a copy of their entire Atlanta datacentre at one or more other datacentres. Would they be able to put together some plan to do this on a regular basis, and to switch sites if such a failure occurs again?
6. Is there anything Curse could do to prevent a similar failure from taking out so many sites at once? Having a redundancy system that depends on a few critical components seems a lot like putting too many eggs in one basket to me!
And finally a question for the Minecraft forums staff: Could they please look into having the Minecraft Wiki site hosted on a different set of servers? It takes a lot of time to learn about redstone circuitry and I can't learn much while the wiki is down on the mat for four days!
Cheers ...
BrickVoid
1. Unlikely. Hardware is, though, prone to malfunctioning as well as firmware and other software.
2. Because, if you read the post, the SAN server malfunctioned, backup launched and malfunctioned, then so did the support NAS servers. there was no way they could've hosted it.
3. Depends on the issue, as this time it had much to do with the firmware provided by the provider of said hardware.
4. Most probably not.
5. Curse is, after all, not a company with millions of dollars and hundreds of employees. It's simply financially impossible to do such a thing.
6. Unless they reroute all of their structure, no.
Minecraft should stay away from that sh*tty curse network. they got their fingers in their asses all day long, while minecraft is actually attempting to get somewhere...
Ditch CURSE, As a Pre-Paying owner of minecraft, i like to be able to access the forums when i need to....Not when f- curse say so...
I don't have anything to say to this. Mojang has no affiliation with the MinecraftForums. They were origininally set up by Quatroking, as well as the wiki. Curse provides quite a relief for the hosting, and it can handle a LOT more traffic now. If you were here in the early stages of launching you would've known how much better the handling is now. I suggest you actually try to host a site with thousands of visitors daily, and then for 24 sites. Think before you put down **** about a "company" That is willing to provide hosting services for a forum for a game, and incorporating it into their network. This is the first time such a thing has happened.
Quote from ShatteredLight »
Well, glad the site's back up anyway. My question is, Did lulzsec play a part in this? I read that lulzsec was screwing around with some kind of minecraft-related thing. I am aware that it wasn't just the minecraft areas of Curse that were effected, but I haven't been able to find anything else about Minecraft that recently had downtime.
No, lulzsec has other things on their hands as far as I'm concerned.
-------------
Some of you need to realize this could've happened to any place where the site is being hosted. Just because it happened with Curse doesn't make it impossible to happen anywhere else. It was extremely bad luck, and now they know how to prepare for it better. This will be a learning experience for them. Nobody is perfect, and all it takes it one bad day (or a couple of bad days) to learn how to prepare for the future.
-------------
"Why did Curse administrators take the long way around and take out four days of site usability instead of immediately getting working copies of sites up so at least they were usable?"
You have to remember that the Minecraft forums + wiki aren't the only sites they are hosting. It will take time to move all of their sites over to a different server. This will include files and the database. Then, they'll have to edit each of their sites to make them work correctly with the new server, this includes things like database connections.
Most likely the backup server won't be as powerful as their current one, so they'll have to go through all of their sites and strip various features that could add pressure to the server.
This is all assuming they have a backup server.
Their future plans should invest into more than one data center + servers and have mirror copies of their sites. This way, if one place fails (hardware, power failure, weather), the other mirror sites should still be working fine.
Stuff happens. It is called real life and if you had bothered to actually read the OP you would realize that there were multiple failures including stuff that was completely beyond their control. More to the point, once the whole network of servers went down, they didn't dare simply flip the switch to put everything back on right away.
I, for one, am very glad that they took their time to do it right. Had they not done so, posts would have been lost or much more bitching about how various parts of the forums weren't working or pages missing from the wiki and all sorts of other potential problems could have happened.
The only part of the summary that I find a little bit offensive is the claim it only took 24 hours to fix the problems. The site has been going up and down for much longer than just one day. Futhermore, I don't really understand why other parts of the "CURSE Network" came up but the Minecraft stuff didn't. I'm sure some of that is due to the size of the Minecraft community and because so much of the community is "officially" using these servers. Just be honest with us, and don't try to sugar coat what must have been a network technician's worst nightmare come true. Don't try to downplay the extent of the problems.
Also for the kiddies to note, this was also a financial disaster in terms of not just having to replace equipment (which costs $$$) and a whole bunch over technician overtime (even more $$$), but it also cost advertising revenue (still more $$$) not to mention a loss of goodwill (where many in the community may go elsewhere). I'm sure these guys are completely aware of that, but give them a break and don't go rubbing it into their face.
For those who think that CURSE "owes" you something because you paid for Minecraft, that is pure and utter BS on your part. You paid for the game, not forum access. Did anybody ever have problems actually playing the game during this time period? I thought not. Your "pre-paying owner of Minecraft" has nothing to do with this whole thing so crawl back under your rock and ditch that holier than thou attitude. Mojang did not promise you anything in terms of forum access when you purchased this game.
You can "Ditch CURSE" if you care, because there are other forum communities out there. Look them up, and I for one would be glad if somebody who is so petty like this would leave the "community" here on these forums on CURSE. Don't let the door slam you in the behind as you leave. I'm not going to miss people with an attitude like this.
Version 2.1 now updated for MC 1.6.2
Support my mod!
Sweet sally in the ally
hey i just downloaded minecraft and was wanting to get better texture how do look up what mindcraft im on 1.6 or 1.5 where can i find this info
~StoneIronPIG! AKA,P0rtal
_
/ \
|
| + /
|------
\
and yes the LEFT arrow looks like a tilted Pick-Axe
After doing all of that we were finally able to get our Linux servers access to their data and have them running again.
It wasn't as simple as just having a backup, it wasn't as simple as just copying files. I know from the outside it may seem that way but if it was we would've had the site up long ago.
The 24 hour figure was the 24 hours of Windows running check disk on a many terabyte RAID 50 array.(This was one portion of the disks. See Xtek's post above for the full time.)
After about 48 hours we were able to bring the Windows .NET sites online. They were able to mount their volumes from the SAN. The bug listed was only affecting our Linux based sites which includes MCF.
There are three of our top technicians taking a very long sleep out in Atlanta right now after what certainly was a technician's worst nightmare. Plus the five or so hardware company representatives that were there or on the phone.
I have been awake for nearly 24 hours now.
No, they had nothing to do with this. This was entirely a hardware failure.
It's just a messy, annoying, and ironic failure. There's nothing anyone could have done to predict it, and they acted as fast as they possibly could to get the site back.
Also, realize this could have ended a lot worse. Say the HDD array failed so bad that we lost data. That would be bad, and chances are, we'd be down significantly longer than we were.
Also, you could still play Minecraft while the forums were down. You paid for the game, not unrestricted access to the forums and website.
1. Unlikely. Hardware is, though, prone to malfunctioning as well as firmware and other software.
2. Because, if you read the post, the SAN server malfunctioned, backup launched and malfunctioned, then so did the support NAS servers. there was no way they could've hosted it.
3. Depends on the issue, as this time it had much to do with the firmware provided by the provider of said hardware.
4. Most probably not.
5. Curse is, after all, not a company with millions of dollars and hundreds of employees. It's simply financially impossible to do such a thing.
6. Unless they reroute all of their structure, no.
I don't have anything to say to this. Mojang has no affiliation with the MinecraftForums. They were origininally set up by Quatroking, as well as the wiki. Curse provides quite a relief for the hosting, and it can handle a LOT more traffic now. If you were here in the early stages of launching you would've known how much better the handling is now. I suggest you actually try to host a site with thousands of visitors daily, and then for 24 sites. Think before you put down **** about a "company" That is willing to provide hosting services for a forum for a game, and incorporating it into their network. This is the first time such a thing has happened.
No, lulzsec has other things on their hands as far as I'm concerned.
-novice pixel artist and storywriter-
Actually play minecraft/APB/whatever (Play APB.)
-novice pixel artist and storywriter-