postgres-xc-general Mailing List for Postgres-XC

Brought to you by: ahsanhadi, amitdkhan, ashutoshbapat, gabbasb, and 3 others

postgres-xc-general — General info and messages

You can subscribe to this list here.

2010	Jan	Feb	Mar	Apr	May (2)	Jun	Jul	Aug (6)	Sep	Oct (19)	Nov (1)	Dec
2011	Jan (12)	Feb (1)	Mar (4)	Apr (4)	May (32)	Jun (12)	Jul (11)	Aug (1)	Sep (6)	Oct (3)	Nov	Dec (10)
2012	Jan (11)	Feb (1)	Mar (3)	Apr (25)	May (53)	Jun (38)	Jul (103)	Aug (54)	Sep (31)	Oct (66)	Nov (77)	Dec (20)
2013	Jan (91)	Feb (86)	Mar (103)	Apr (107)	May (25)	Jun (37)	Jul (17)	Aug (59)	Sep (38)	Oct (78)	Nov (29)	Dec (15)
2014	Jan (23)	Feb (82)	Mar (118)	Apr (101)	May (103)	Jun (45)	Jul (6)	Aug (10)	Sep	Oct (32)	Nov	Dec (9)
2015	Jan (3)	Feb (5)	Mar	Apr (1)	May	Jun	Jul (9)	Aug (4)	Sep (3)	Oct	Nov	Dec
2016	Jan (3)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2017	Jan	Feb	Mar	Apr	May	Jun (3)	Jul	Aug	Sep	Oct	Nov	Dec
2018	Jan	Feb	Mar	Apr	May (4)	Jun	Jul	Aug	Sep	Oct	Nov	Dec

Flat | Threaded

Re: [Postgres-xc-general] Our general use case

From: Vladimir S. <vst...@gm...> - 2012-11-05 16:26:48

On Mon, Nov 05, 2012 at 07:22:54PM +0300, Vladimir Stavrinov wrote:

> solution is alliance of drbd + pacemaker + corosync. It is  very

Forgot to add in this team ipvs.


*****************************
###  Vladimir Stavrinov
###  vst...@gm...
*****************************

Re: [Postgres-xc-general] Our general use case

From: Vladimir S. <vst...@gm...> - 2012-11-05 16:23:02

On Tue, Oct 30, 2012 at 10:02 PM, Roger Mayes <rog...@gm...> wrote:

> Odd that their list publishes our email addresses, and if I hit "reply" (in
> gmail), it goes to the person who made the post rather than to the list.

Click arrow on the right of "Reply" button, then click "Reply all". Or
copy/past list address into CC: field.

> gmail is more-or-less commercial software, isn't it?  Lol.

No matter commercial or not. More important is open source criteria.

> It seems like our conversation has gone a bit afield of the list's topic,
> anyway.

We continue discussion about HA for XC.

> So have you found any way to do get write scalability and high availability?

I've already wrote here about this. I think at this time the best
solution is alliance of drbd + pacemaker + corosync. It is  very
effective, reliable and rather simple setup for duplicating every
node. I have such architecture running lot of vz boxes. But it is no
matter what services are there running. The difference is only
resource agent, i.e. start/stop script. BTW among others (most of them
are web, application and database servers) there are postgresql
running inside those vz boxes without any problems.

Re: [Postgres-xc-general] Our general use case

From: Roger M. <rog...@gm...> - 2012-11-05 15:31:27

On Tue, Oct 30, 2012 at 4:58 AM, Vladimir Stavrinov <vst...@gm...>wrote:

> On Mon, Oct 29, 2012 at 01:59:47PM -0700, Roger Mayes wrote:
>
> >    Restoring a virt from an image is one way of restoring from a
> >    backup.  It's a bit quicker and more thorough, unless you have
>
> Normally You are restoring database from sql dump. If You want to do
> this with data files, then You should synchronize all of them over the
> cluster.
>
> >    our hosting environment is limited.  They're inexpensive enough for
> >    us because they use commodity hardware, but using commodity
>
> But cloud infrastructure with all management tools and service itself
> costs money too. One of my providers offered me cloud instead of hardware
> rent. But calculation showed it's twice more expensive for the same
> capacity. Though it may be not a common case due to specific
> requirements, but nevertheless I think cloud is not suitable for
> cluster. Though I see the convenience for customers may overcome other
> factors.
>
> >    hardware means they can only give us so much cpu, ram, io, and
> >    network bandwidth on a single host.  Hence the need for clustering.
>
> First, You loose performance with additional level for virtual machine
> (though not so much). And second, You can't upgrade kernel running on
> hardware host, leaving it on providers own. But this impacts not only
> performance, but reliability too. Though I see it is not interesting for
> You as You are getting that capacity what You are paying for.
>
> >    Security became the victim of "speed" meaning system performance,
> >    or "speed" meaning expediency as far as getting it setup and
>
> First is right.
>
> >    running goes?  Nobody should ever run database processes as the
> >    root user.  And they should never open direct database access ports
> >    to the outside world.
>
> You are absolutely right here.
>
> >    The users themselves don't expect their posts to stay around forever.
>
> First, they prefer delete unneeded data them self, than loose what they
> need.
> Second, they can tolerate to loose old data, but just this data You can
> restore
> from backup. But they don't want to loose recent data, that they certainly
> lose
> at system crash. With lost data You can lose Your users, not only as
> records in
> database, that was dropped on crash, but existent users as persons, who
> don't
> want to use Your service any more, as well as new potential users, who will
> never uses Your service.
>
> >    As long as the downtime is not within the first few hours after
> >    Taylor makes her post, it's not a huge deal.
>
> But You can not plan the time of Your crash. More over under peak load
> chance
> to fail increased. And low probability of crash doesn't means it never
> occur.
> It happens at the "best" time when You don't expect. The Chernobyl disaster
> occured as result of overlapping of five events, every of which was low
> probability.
>
> >    I guess we have HA in the sense that we can continue to operate if
> >    one of our load balanced front end web servers goes down, as long
> >    as it doesn't happen right when we're at peak load.  But our
>
> There are no problems with HA for web servers as such. There are number of
> different solutions. But we are talking here about database and it is a
> quite different problem.
>
> >    memcached clusters and database clusters have never yet been really
> >    set up to continue running if we were to lose a node.  Although it
>
> Are You sleeping well? I was already scared.
>
> >    would help us a lot of we could, because then we could handle less
> >    risk-tolerant, higher dollar ventures without having to get into
> >    dealing with Oracle (which creates a lot of risk by itself, because
> >    of the high costs involved).
>
> As I mentioned early, even RAC may crash. Besides, it have no write
> scalability
> (Your lovely commercial software). In my practice in the past I did some
> fault
> tolerant setup based on Oracle Data Guard technology, but I was satisfied
> with
> it.
>
> OK, all Your arguments make some sense. I agree, in Your example there may
> be
> some tolerance for data lost and down time, in some sense and to some
> degree.
> But this is Your reasoning only. Can You imagine crash of Your system in
> reality? I would say, if You need scalability means You are running a big
> system. With big system You most likely will suffer big losses in case of
> disaster.
>
> P.S. Your last message did not arrived to mailing list. If it is not
> mistake, I
> will leave it untouched. But If You want, I can bounce both Your message
> and my
> response to mailing list as is, without modification. It is what my open
> source
> software can, but Your lovely commercial can not to do.


That's pretty funny.  That's a good point that there are some things open
source software can do that no currently available well-known commercial
software can.  The lack of write scalability in Oracle is a big one.

Odd that their list publishes our email addresses, and if I hit "reply" (in
gmail), it goes to the person who made the post rather than to the list.
 That's not the way most other email lists work.  I haven't used any
commercial software for several years now, unless you count gmail.  I guess
gmail is more-or-less commercial software, isn't it?  Lol.

It seems like our conversation has gone a bit afield of the list's topic,
anyway.

With virtual cloud servers, we can rent a bunch of hardware for the space
of a few hours, while most of the time renting such a small amount of
hardware that the cost is almost nothing.

No I don't sleep well, and I'd like to get out of my situation as soon as
possible, but at least I have a little food on the table for the moment.

So have you found any way to do get write scalability and high
availability?   I've not yet thoroughly investigated all of the NoSQL
systems yet.




> -----
> --
>
> ***************************
> ##  Vladimir Stavrinov
> ##  vst...@gm...
> ***************************
>
>

Re: [Postgres-xc-general] Our general use case

From: Vladimir S. <vst...@gm...> - 2012-11-05 15:31:24

On Mon, Oct 29, 2012 at 01:59:47PM -0700, Roger Mayes wrote:

>    Restoring a virt from an image is one way of restoring from a
>    backup.  It's a bit quicker and more thorough, unless you have

Normally You are restoring database from sql dump. If You want to do
this with data files, then You should synchronize all of them over the
cluster.

>    our hosting environment is limited.  They're inexpensive enough for
>    us because they use commodity hardware, but using commodity

But cloud infrastructure with all management tools and service itself
costs money too. One of my providers offered me cloud instead of hardware
rent. But calculation showed it's twice more expensive for the same
capacity. Though it may be not a common case due to specific
requirements, but nevertheless I think cloud is not suitable for
cluster. Though I see the convenience for customers may overcome other
factors.

>    hardware means they can only give us so much cpu, ram, io, and
>    network bandwidth on a single host.  Hence the need for clustering.

First, You loose performance with additional level for virtual machine
(though not so much). And second, You can't upgrade kernel running on
hardware host, leaving it on providers own. But this impacts not only
performance, but reliability too. Though I see it is not interesting for
You as You are getting that capacity what You are paying for.

>    Security became the victim of "speed" meaning system performance,
>    or "speed" meaning expediency as far as getting it setup and

First is right.

>    running goes?  Nobody should ever run database processes as the
>    root user.  And they should never open direct database access ports
>    to the outside world.

You are absolutely right here.

>    The users themselves don't expect their posts to stay around forever.

First, they prefer delete unneeded data them self, than loose what they need.
Second, they can tolerate to loose old data, but just this data You can restore
from backup. But they don't want to loose recent data, that they certainly lose
at system crash. With lost data You can lose Your users, not only as records in
database, that was dropped on crash, but existent users as persons, who don't
want to use Your service any more, as well as new potential users, who will
never uses Your service.

>    As long as the downtime is not within the first few hours after
>    Taylor makes her post, it's not a huge deal.

But You can not plan the time of Your crash. More over under peak load chance
to fail increased. And low probability of crash doesn't means it never occur.
It happens at the "best" time when You don't expect. The Chernobyl disaster
occured as result of overlapping of five events, every of which was low
probability.

>    I guess we have HA in the sense that we can continue to operate if
>    one of our load balanced front end web servers goes down, as long
>    as it doesn't happen right when we're at peak load.  But our

There are no problems with HA for web servers as such. There are number of
different solutions. But we are talking here about database and it is a
quite different problem.

>    memcached clusters and database clusters have never yet been really
>    set up to continue running if we were to lose a node.  Although it

Are You sleeping well? I was already scared.

>    would help us a lot of we could, because then we could handle less
>    risk-tolerant, higher dollar ventures without having to get into
>    dealing with Oracle (which creates a lot of risk by itself, because
>    of the high costs involved).

As I mentioned early, even RAC may crash. Besides, it have no write scalability
(Your lovely commercial software). In my practice in the past I did some fault
tolerant setup based on Oracle Data Guard technology, but I was satisfied with
it.

OK, all Your arguments make some sense. I agree, in Your example there may be
some tolerance for data lost and down time, in some sense and to some degree.
But this is Your reasoning only. Can You imagine crash of Your system in
reality? I would say, if You need scalability means You are running a big
system. With big system You most likely will suffer big losses in case of
disaster.

P.S. Your last message did not arrived to mailing list. If it is not mistake, I
will leave it untouched. But If You want, I can bounce both Your message and my
response to mailing list as is, without modification. It is what my open source
software can, but Your lovely commercial can not to do.

-- 

***************************
##  Vladimir Stavrinov
##  vst...@gm...
***************************

Re: [Postgres-xc-general] Our general use case

From: Roger M. <rog...@gm...> - 2012-11-05 15:31:22

On Mon, Oct 29, 2012 at 4:56 AM, Vladimir Stavrinov <vst...@gm...>wrote:

> On Sun, Oct 28, 2012 at 5:35 AM, Shavais Zarathustra <sh...@gm...>
> wrote:
>
> > Well, the point would be to get a replacement server going, for the
> server
> > that died, with all the software installed and the configuration set up,
> > after which my hope has been that we'd be able to reinitialize the
> database
> > on that host and perform some kind of recovery process to get it back up
> and
> > working within the cluster.  But maybe that requires some of the HA
> features
> > that you're talking about that XC doesn't have working yet?
>
> With HA there will no down time, so You will have enough time for
> recovering failed node. Without HA You should recreate cluster from
> scratch from backup. In both cases virtual machine helps not so much.
>
>
Restoring a virt from an image is one way of restoring from a backup.  It's
a bit quicker and more thorough, unless you have something like Legato.

> clustering stuff, together with Oracle's database clustering, which was
> all
>
> I heard a story where whole bank was crashed on RAC. Even HA did not help.
>
> > of a brave new/old world for me, with all this poor man's Open Source
> stuff,
>
> "poor man's" ? Great!
>
>
There are a lot of sort of dirty gems scattered about the muddy, sea weed
and jelly fish-litered beach of Open Source software, which with a bit of
manual buffing, are quite beautiful in their particular ways - but that
landscape is not to be compared with the pristine, opulent castles and
treasure rooms of commercial software.  But then, neither is the price tag.


> > Well, the hardware they have at these pseudo-cloud datacenters is all
>
> What You are describing here and below is cloud infrastructure that
> itself has scalability and HA, what cluster must have too. So what for
> do You want one inside other? You loose efficiency and money.
>
>
As I explained before, the scalability offered by a single host in our
hosting environment is limited.  They're inexpensive enough for us because
they use commodity hardware, but using commodity hardware means they can
only give us so much cpu, ram, io, and network bandwidth on a single host.
 Hence the need for clustering.


> >> logs should be handled on every node, it is not so simple.
> >
> > Yeah, I was thinking this was probably the case.  So what I'm not sure
> of is
> > what you do after your datanode has been recovered as far as you can get
> it
> > recovered using the usual single database recovery techniques - how do
> you
>
> Without HA at this point down time started again. And if You succeed
> in recovering at some point in time where this node will consistent
> with cluster, then You will be happy, otherwise You will recreate Your
> cluster from scratch from backup again.
>
> > Unix Admin "is only as good as their backups".  That's certainly the
> truth.
>
> No doubt, definitely! Backup always and everywhere. But with backup
> You can recover Your system at some point in past. So you have both
> joys: down time and data lost in this case too. Backup is not
> alternative for HA and vice verse: we need them both.
>
> > But I'm not concerned about the security of my DBA role, in fact I've
> been
>
> One developer boasted me how he can do database user becomes unix user
> root and shuts down the system. The answer on my horror was something
> similar what we are reading here: the security there becomes the
> victim of speed. And it was very serious and responsible institution
> where this database was running.


Security became the victim of "speed" meaning system performance, or
"speed" meaning expediency as far as getting it setup and running goes?
 Nobody should ever run database processes as the root user.  And they
should never open direct database access ports to the outside world.


> > need a throat to cut before I can cut it.  The risk of a crash is small
> and
> > tolerable, but if I'm not convinced I'll be able to handle the load -
> that's
> > a show stopper.
>
> If You
> need cluster means You are doing something that require HA.

What data You are  processing that requires scalability?


As I mentioned before - one example has been forum posts from Taylor Swift
fans in reaction to Taylor Swift making a Facebook post or a Twitter Tweet.
 If we lose that data at some point in the future, it's not anywhere near
as important as being able to handle a whole lot of people making and
reading those posts all at once.  In fact, we eventually delete the data
our selves.  That's just one example.


> Is it garbage You willing to loose?


The users themselves don't expect their posts to stay around forever.


> What are those business processes that make Your
> heavy load?


Taylor Swift makes a Facebook post.  250 thousand Taylor Swift fans from
all over the country immediately jump onto our system and start messaging
each other, and posting videos, pictures, etc., and over the course of the
next several hours, several million other people eventually find their way
to the site.  We collect analytics on all that traffic and make (general
trend) reports to various commercial interests.


> Are they nonsense that can tolerate down time?


As long as the downtime is not within the first few hours after Taylor
makes her post, it's not a huge deal.


> Please
> tell me, do You have cluster that running without HA? Or do you know
> such?
>
>
We have HA in the sense that apart from the backplanes, there's no single
point of failure in our hardware setup.  And, I guess we have HA in the
sense that we can continue to operate if one of our load balanced front end
web servers goes down, as long as it doesn't happen right when we're at
peak load.  But our memcached clusters and database clusters have never yet
been really set up to continue running if we were to lose a node.

Although it would help us a lot of we could, because then we could handle
less risk-tolerant, higher dollar ventures without having to get into
dealing with Oracle (which creates a lot of risk by itself, because of the
high costs involved).

Flat | Threaded