You can subscribe to this list here.
2010 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(2) |
Jun
|
Jul
|
Aug
(6) |
Sep
|
Oct
(19) |
Nov
(1) |
Dec
|
---|---|---|---|---|---|---|---|---|---|---|---|---|
2011 |
Jan
(12) |
Feb
(1) |
Mar
(4) |
Apr
(4) |
May
(32) |
Jun
(12) |
Jul
(11) |
Aug
(1) |
Sep
(6) |
Oct
(3) |
Nov
|
Dec
(10) |
2012 |
Jan
(11) |
Feb
(1) |
Mar
(3) |
Apr
(25) |
May
(53) |
Jun
(38) |
Jul
(103) |
Aug
(54) |
Sep
(31) |
Oct
(66) |
Nov
(77) |
Dec
(20) |
2013 |
Jan
(91) |
Feb
(86) |
Mar
(103) |
Apr
(107) |
May
(25) |
Jun
(37) |
Jul
(17) |
Aug
(59) |
Sep
(38) |
Oct
(78) |
Nov
(29) |
Dec
(15) |
2014 |
Jan
(23) |
Feb
(82) |
Mar
(118) |
Apr
(101) |
May
(103) |
Jun
(45) |
Jul
(6) |
Aug
(10) |
Sep
|
Oct
(32) |
Nov
|
Dec
(9) |
2015 |
Jan
(3) |
Feb
(5) |
Mar
|
Apr
(1) |
May
|
Jun
|
Jul
(9) |
Aug
(4) |
Sep
(3) |
Oct
|
Nov
|
Dec
|
2016 |
Jan
(3) |
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2017 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(3) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2018 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(4) |
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
S | M | T | W | T | F | S |
---|---|---|---|---|---|---|
|
|
|
|
1
|
2
(3) |
3
|
4
|
5
(5) |
6
(22) |
7
(1) |
8
(4) |
9
|
10
|
11
|
12
|
13
(4) |
14
(6) |
15
(5) |
16
(18) |
17
|
18
|
19
(8) |
20
(1) |
21
|
22
|
23
|
24
|
25
|
26
|
27
|
28
|
29
|
30
|
|
From: Vladimir S. <vst...@gm...> - 2012-11-05 16:26:48
|
On Mon, Nov 05, 2012 at 07:22:54PM +0300, Vladimir Stavrinov wrote: > solution is alliance of drbd + pacemaker + corosync. It is very Forgot to add in this team ipvs. ***************************** ### Vladimir Stavrinov ### vst...@gm... ***************************** |
From: Vladimir S. <vst...@gm...> - 2012-11-05 16:23:02
|
On Tue, Oct 30, 2012 at 10:02 PM, Roger Mayes <rog...@gm...> wrote: > Odd that their list publishes our email addresses, and if I hit "reply" (in > gmail), it goes to the person who made the post rather than to the list. Click arrow on the right of "Reply" button, then click "Reply all". Or copy/past list address into CC: field. > gmail is more-or-less commercial software, isn't it? Lol. No matter commercial or not. More important is open source criteria. > It seems like our conversation has gone a bit afield of the list's topic, > anyway. We continue discussion about HA for XC. > So have you found any way to do get write scalability and high availability? I've already wrote here about this. I think at this time the best solution is alliance of drbd + pacemaker + corosync. It is very effective, reliable and rather simple setup for duplicating every node. I have such architecture running lot of vz boxes. But it is no matter what services are there running. The difference is only resource agent, i.e. start/stop script. BTW among others (most of them are web, application and database servers) there are postgresql running inside those vz boxes without any problems. |
From: Roger M. <rog...@gm...> - 2012-11-05 15:31:27
|
On Tue, Oct 30, 2012 at 4:58 AM, Vladimir Stavrinov <vst...@gm...>wrote: > On Mon, Oct 29, 2012 at 01:59:47PM -0700, Roger Mayes wrote: > > > Restoring a virt from an image is one way of restoring from a > > backup. It's a bit quicker and more thorough, unless you have > > Normally You are restoring database from sql dump. If You want to do > this with data files, then You should synchronize all of them over the > cluster. > > > our hosting environment is limited. They're inexpensive enough for > > us because they use commodity hardware, but using commodity > > But cloud infrastructure with all management tools and service itself > costs money too. One of my providers offered me cloud instead of hardware > rent. But calculation showed it's twice more expensive for the same > capacity. Though it may be not a common case due to specific > requirements, but nevertheless I think cloud is not suitable for > cluster. Though I see the convenience for customers may overcome other > factors. > > > hardware means they can only give us so much cpu, ram, io, and > > network bandwidth on a single host. Hence the need for clustering. > > First, You loose performance with additional level for virtual machine > (though not so much). And second, You can't upgrade kernel running on > hardware host, leaving it on providers own. But this impacts not only > performance, but reliability too. Though I see it is not interesting for > You as You are getting that capacity what You are paying for. > > > Security became the victim of "speed" meaning system performance, > > or "speed" meaning expediency as far as getting it setup and > > First is right. > > > running goes? Nobody should ever run database processes as the > > root user. And they should never open direct database access ports > > to the outside world. > > You are absolutely right here. > > > The users themselves don't expect their posts to stay around forever. > > First, they prefer delete unneeded data them self, than loose what they > need. > Second, they can tolerate to loose old data, but just this data You can > restore > from backup. But they don't want to loose recent data, that they certainly > lose > at system crash. With lost data You can lose Your users, not only as > records in > database, that was dropped on crash, but existent users as persons, who > don't > want to use Your service any more, as well as new potential users, who will > never uses Your service. > > > As long as the downtime is not within the first few hours after > > Taylor makes her post, it's not a huge deal. > > But You can not plan the time of Your crash. More over under peak load > chance > to fail increased. And low probability of crash doesn't means it never > occur. > It happens at the "best" time when You don't expect. The Chernobyl disaster > occured as result of overlapping of five events, every of which was low > probability. > > > I guess we have HA in the sense that we can continue to operate if > > one of our load balanced front end web servers goes down, as long > > as it doesn't happen right when we're at peak load. But our > > There are no problems with HA for web servers as such. There are number of > different solutions. But we are talking here about database and it is a > quite different problem. > > > memcached clusters and database clusters have never yet been really > > set up to continue running if we were to lose a node. Although it > > Are You sleeping well? I was already scared. > > > would help us a lot of we could, because then we could handle less > > risk-tolerant, higher dollar ventures without having to get into > > dealing with Oracle (which creates a lot of risk by itself, because > > of the high costs involved). > > As I mentioned early, even RAC may crash. Besides, it have no write > scalability > (Your lovely commercial software). In my practice in the past I did some > fault > tolerant setup based on Oracle Data Guard technology, but I was satisfied > with > it. > > OK, all Your arguments make some sense. I agree, in Your example there may > be > some tolerance for data lost and down time, in some sense and to some > degree. > But this is Your reasoning only. Can You imagine crash of Your system in > reality? I would say, if You need scalability means You are running a big > system. With big system You most likely will suffer big losses in case of > disaster. > > P.S. Your last message did not arrived to mailing list. If it is not > mistake, I > will leave it untouched. But If You want, I can bounce both Your message > and my > response to mailing list as is, without modification. It is what my open > source > software can, but Your lovely commercial can not to do. That's pretty funny. That's a good point that there are some things open source software can do that no currently available well-known commercial software can. The lack of write scalability in Oracle is a big one. Odd that their list publishes our email addresses, and if I hit "reply" (in gmail), it goes to the person who made the post rather than to the list. That's not the way most other email lists work. I haven't used any commercial software for several years now, unless you count gmail. I guess gmail is more-or-less commercial software, isn't it? Lol. It seems like our conversation has gone a bit afield of the list's topic, anyway. With virtual cloud servers, we can rent a bunch of hardware for the space of a few hours, while most of the time renting such a small amount of hardware that the cost is almost nothing. No I don't sleep well, and I'd like to get out of my situation as soon as possible, but at least I have a little food on the table for the moment. So have you found any way to do get write scalability and high availability? I've not yet thoroughly investigated all of the NoSQL systems yet. > ----- > -- > > *************************** > ## Vladimir Stavrinov > ## vst...@gm... > *************************** > > |
From: Vladimir S. <vst...@gm...> - 2012-11-05 15:31:24
|
On Mon, Oct 29, 2012 at 01:59:47PM -0700, Roger Mayes wrote: > Restoring a virt from an image is one way of restoring from a > backup. It's a bit quicker and more thorough, unless you have Normally You are restoring database from sql dump. If You want to do this with data files, then You should synchronize all of them over the cluster. > our hosting environment is limited. They're inexpensive enough for > us because they use commodity hardware, but using commodity But cloud infrastructure with all management tools and service itself costs money too. One of my providers offered me cloud instead of hardware rent. But calculation showed it's twice more expensive for the same capacity. Though it may be not a common case due to specific requirements, but nevertheless I think cloud is not suitable for cluster. Though I see the convenience for customers may overcome other factors. > hardware means they can only give us so much cpu, ram, io, and > network bandwidth on a single host. Hence the need for clustering. First, You loose performance with additional level for virtual machine (though not so much). And second, You can't upgrade kernel running on hardware host, leaving it on providers own. But this impacts not only performance, but reliability too. Though I see it is not interesting for You as You are getting that capacity what You are paying for. > Security became the victim of "speed" meaning system performance, > or "speed" meaning expediency as far as getting it setup and First is right. > running goes? Nobody should ever run database processes as the > root user. And they should never open direct database access ports > to the outside world. You are absolutely right here. > The users themselves don't expect their posts to stay around forever. First, they prefer delete unneeded data them self, than loose what they need. Second, they can tolerate to loose old data, but just this data You can restore from backup. But they don't want to loose recent data, that they certainly lose at system crash. With lost data You can lose Your users, not only as records in database, that was dropped on crash, but existent users as persons, who don't want to use Your service any more, as well as new potential users, who will never uses Your service. > As long as the downtime is not within the first few hours after > Taylor makes her post, it's not a huge deal. But You can not plan the time of Your crash. More over under peak load chance to fail increased. And low probability of crash doesn't means it never occur. It happens at the "best" time when You don't expect. The Chernobyl disaster occured as result of overlapping of five events, every of which was low probability. > I guess we have HA in the sense that we can continue to operate if > one of our load balanced front end web servers goes down, as long > as it doesn't happen right when we're at peak load. But our There are no problems with HA for web servers as such. There are number of different solutions. But we are talking here about database and it is a quite different problem. > memcached clusters and database clusters have never yet been really > set up to continue running if we were to lose a node. Although it Are You sleeping well? I was already scared. > would help us a lot of we could, because then we could handle less > risk-tolerant, higher dollar ventures without having to get into > dealing with Oracle (which creates a lot of risk by itself, because > of the high costs involved). As I mentioned early, even RAC may crash. Besides, it have no write scalability (Your lovely commercial software). In my practice in the past I did some fault tolerant setup based on Oracle Data Guard technology, but I was satisfied with it. OK, all Your arguments make some sense. I agree, in Your example there may be some tolerance for data lost and down time, in some sense and to some degree. But this is Your reasoning only. Can You imagine crash of Your system in reality? I would say, if You need scalability means You are running a big system. With big system You most likely will suffer big losses in case of disaster. P.S. Your last message did not arrived to mailing list. If it is not mistake, I will leave it untouched. But If You want, I can bounce both Your message and my response to mailing list as is, without modification. It is what my open source software can, but Your lovely commercial can not to do. -- *************************** ## Vladimir Stavrinov ## vst...@gm... *************************** |
From: Roger M. <rog...@gm...> - 2012-11-05 15:31:22
|
On Mon, Oct 29, 2012 at 4:56 AM, Vladimir Stavrinov <vst...@gm...>wrote: > On Sun, Oct 28, 2012 at 5:35 AM, Shavais Zarathustra <sh...@gm...> > wrote: > > > Well, the point would be to get a replacement server going, for the > server > > that died, with all the software installed and the configuration set up, > > after which my hope has been that we'd be able to reinitialize the > database > > on that host and perform some kind of recovery process to get it back up > and > > working within the cluster. But maybe that requires some of the HA > features > > that you're talking about that XC doesn't have working yet? > > With HA there will no down time, so You will have enough time for > recovering failed node. Without HA You should recreate cluster from > scratch from backup. In both cases virtual machine helps not so much. > > Restoring a virt from an image is one way of restoring from a backup. It's a bit quicker and more thorough, unless you have something like Legato. > clustering stuff, together with Oracle's database clustering, which was > all > > I heard a story where whole bank was crashed on RAC. Even HA did not help. > > > of a brave new/old world for me, with all this poor man's Open Source > stuff, > > "poor man's" ? Great! > > There are a lot of sort of dirty gems scattered about the muddy, sea weed and jelly fish-litered beach of Open Source software, which with a bit of manual buffing, are quite beautiful in their particular ways - but that landscape is not to be compared with the pristine, opulent castles and treasure rooms of commercial software. But then, neither is the price tag. > > Well, the hardware they have at these pseudo-cloud datacenters is all > > What You are describing here and below is cloud infrastructure that > itself has scalability and HA, what cluster must have too. So what for > do You want one inside other? You loose efficiency and money. > > As I explained before, the scalability offered by a single host in our hosting environment is limited. They're inexpensive enough for us because they use commodity hardware, but using commodity hardware means they can only give us so much cpu, ram, io, and network bandwidth on a single host. Hence the need for clustering. > >> logs should be handled on every node, it is not so simple. > > > > Yeah, I was thinking this was probably the case. So what I'm not sure > of is > > what you do after your datanode has been recovered as far as you can get > it > > recovered using the usual single database recovery techniques - how do > you > > Without HA at this point down time started again. And if You succeed > in recovering at some point in time where this node will consistent > with cluster, then You will be happy, otherwise You will recreate Your > cluster from scratch from backup again. > > > Unix Admin "is only as good as their backups". That's certainly the > truth. > > No doubt, definitely! Backup always and everywhere. But with backup > You can recover Your system at some point in past. So you have both > joys: down time and data lost in this case too. Backup is not > alternative for HA and vice verse: we need them both. > > > But I'm not concerned about the security of my DBA role, in fact I've > been > > One developer boasted me how he can do database user becomes unix user > root and shuts down the system. The answer on my horror was something > similar what we are reading here: the security there becomes the > victim of speed. And it was very serious and responsible institution > where this database was running. Security became the victim of "speed" meaning system performance, or "speed" meaning expediency as far as getting it setup and running goes? Nobody should ever run database processes as the root user. And they should never open direct database access ports to the outside world. > > need a throat to cut before I can cut it. The risk of a crash is small > and > > tolerable, but if I'm not convinced I'll be able to handle the load - > that's > > a show stopper. > > If You > need cluster means You are doing something that require HA. What data You are processing that requires scalability? As I mentioned before - one example has been forum posts from Taylor Swift fans in reaction to Taylor Swift making a Facebook post or a Twitter Tweet. If we lose that data at some point in the future, it's not anywhere near as important as being able to handle a whole lot of people making and reading those posts all at once. In fact, we eventually delete the data our selves. That's just one example. > Is it garbage You willing to loose? The users themselves don't expect their posts to stay around forever. > What are those business processes that make Your > heavy load? Taylor Swift makes a Facebook post. 250 thousand Taylor Swift fans from all over the country immediately jump onto our system and start messaging each other, and posting videos, pictures, etc., and over the course of the next several hours, several million other people eventually find their way to the site. We collect analytics on all that traffic and make (general trend) reports to various commercial interests. > Are they nonsense that can tolerate down time? As long as the downtime is not within the first few hours after Taylor makes her post, it's not a huge deal. > Please > tell me, do You have cluster that running without HA? Or do you know > such? > > We have HA in the sense that apart from the backplanes, there's no single point of failure in our hardware setup. And, I guess we have HA in the sense that we can continue to operate if one of our load balanced front end web servers goes down, as long as it doesn't happen right when we're at peak load. But our memcached clusters and database clusters have never yet been really set up to continue running if we were to lose a node. Although it would help us a lot of we could, because then we could handle less risk-tolerant, higher dollar ventures without having to get into dealing with Oracle (which creates a lot of risk by itself, because of the high costs involved). |