postgres-xc-developers Mailing List for Postgres-XC

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

On Sat, Jul 14, 2012 at 1:01 AM, Nikhil Sontakke <ni...@st...>wrote:

> > But, one thing that is essential is catalog updates. Are you suggesting
> that
> > the catalog updates too should be done using some SQL?
>
> Surely, we can do:
>
> BEGIN;
>
> CREATE NODE; /* Hmm, I think we should have a CREATE CLUSTER NODE version*/
>
> ALL the redistribution SQL here;
>
> COMMIT;
>
> The new node info will be visible to the rest of the SQL commands that
> follow this ways.
>
> Michael, I don't have a strong view against adding stuff in the core,
> but since the first cut for redistribution seems to be a bunch of SQL
> grouped together, I thought this might be worth investigating too.
>
Honestly I thought about such solutions before beginning my stuff, but
arrived at the conclusion that we need a native way to manage
redistribution. In this case, you fall under the following limitations:
- How to manage node-level granularity for redistribution operations? In
XC, by design, a SQL is launched globally on the cluster. At the exception
of EXECUTE DIRECT but it only allows SELECT commands.
- When and how to manage the catalog update? Even if it is possible to
update catalog with DMLs, we need native APIs to be able to modify catalog
entries.
- CREATE NODE is used for the addition and deletion of nodes. Data
redistribution does not concern changing the configuration of the cluster.
You got a certain number of Datanodes, Coordinators, and you want to change
the nodes where data of a table is located inside this given cluster. Your
approach makes redistribution dependent on cluster configuration and it is
honestly not welcome to add such degrees of dependencies that may be a
burden in the future if we change once again the way cluster is configured.
- For certain redistribution operations, we do NOT always need a
transaction block! Just take the example of a replicated table changed to a
distributed table. You just need to send a DELETE query to remote nodes to
remove only the tuples that do not satisfy a hash condition. This is one of
those things I am working on now.

To my mind, each argument here makes necessary this feature in core. All
combined even strengthen my arguments.

> Regards,
> Nikhils
>
> > Something I am afraid is not possible with an external utility is
> control of
> > redistribution at node level. For example, an external contrib module or
> > utility will launch SQL queries to xc that have to be treated as global.
> > However, redistribution needs to take care of cases like for example the
> > reduction of nodes for replicated tables. In this case you just need to
> > delete the data from removed nodes. Another easy example is the case of
> an
> > increase of nodes for replicated tables. You need to pick up data on
> > coordinator and then send it only to the new nodes. Those simple examples
> > need a core management to minimize the work of redistribution inside
> > cluster.
> >
> > On 2012/07/13, at 15:09, Ashutosh Bapat <ash...@en...
> >
> > wrote:
> >
> > Even, I am wondering if that would be better.
> >
> > But, one thing that is essential is catalog updates. Are you suggesting
> that
> > the catalog updates too should be done using some SQL?
> >
> > Can you please expand more on your idea, may be providing some examples,
> > pseudo-code etc.?
> >
> > On Fri, Jul 13, 2012 at 10:36 AM, Nikhil Sontakke <ni...@st...>
> > wrote:
> >>
> >> Just a thought.
> >>
> >> If we have a utility which spews out all of these statements to
> >> redistribute a table across node modifications, then we can just wrap
> >> them inside a transaction block and just run that?
> >>
> >> Wont it save all of the core changes?
> >>
> >> Regards,
> >> Nikhils
> >>
> >> On Fri, Jul 13, 2012 at 12:29 AM, Michael Paquier
> >> <mic...@gm...> wrote:
> >> > Hi all,
> >> >
> >> > Please find attached an updated patch adding redistribution
> >> > optimizations
> >> > for replicated tables.
> >> > If the node subset of a replicated table is reduced, the necessary
> nodes
> >> > are
> >> > simply truncated.
> >> > If it is increased, a COPY TO is done to fetch the data, and COPY FROM
> >> > is
> >> > done only on the necessary nodes.
> >> > New regression tests have been added to test that.
> >> >
> >> > Regards,
> >> >
> >> >
> >> > On Thu, Jul 12, 2012 at 5:30 PM, Michael Paquier
> >> > <mic...@gm...>
> >> > wrote:
> >> >>
> >> >> OK, here is the mammoth patch: 3000 lines including docs,
> >> >> implementation
> >> >> and  regressions.
> >> >> The code has been realigned with current master.
> >> >> This patch introduces the latest thing I am working on: the
> >> >> redistribution
> >> >> command tree planning and execution.
> >> >>
> >> >> As I explained before, a redistribution consists of a series of
> >> >> commands
> >> >> (TRUNCATE, REINDEX, DELETE, COPY FROM, COPY TO) that need to be
> >> >> determined
> >> >> depending on the new and old locator information of the relation.
> Each
> >> >> action can be done on a subset of nodes.
> >> >> This patch introduces the basic infrastructure of the command tree
> >> >> build
> >> >> and execution.
> >> >> For the time being, redistribution uses only what is called the
> default
> >> >> command tree consisting of:
> >> >> 1) COPY TO
> >> >> 2) TRUNCATE
> >> >> 3) COPY FROM
> >> >> 4) REINDEX
> >> >> But this structure can be easily completed with more complicated
> >> >> operations.
> >> >> In this patch there is still a small thing missing which is the
> >> >> possibility to launch a COPY FROM on a subset of nodes, particularly
> >> >> useful
> >> >> when redistribution consists of a replicated table whose set of nodes
> >> >> is
> >> >> increased.
> >> >> Compared to the last versions, the impact of redistribution in
> >> >> tablecmds.c
> >> >> is limited.
> >> >>
> >> >> Regards,
> >> >>
> >> >> --
> >> >> Michael Paquier
> >> >> http://michael.otacoo.com
> >> >
> >> >
> >> >
> >> >
> >> > --
> >> > Michael Paquier
> >> > http://michael.otacoo.com
> >> >
> >> >
> >> >
> ------------------------------------------------------------------------------
> >> > Live Security Virtual Conference
> >> > Exclusive live event will cover all the ways today's security and
> >> > threat landscape has changed and how IT managers can respond.
> >> > Discussions
> >> > will include endpoint security, mobile security and the latest in
> >> > malware
> >> > threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> >> > _______________________________________________
> >> > Postgres-xc-developers mailing list
> >> > Pos...@li...
> >> > https://lists.sourceforge.net/lists/listinfo/postgres-xc-developers
> >> >
> >>
> >>
> >>
> >> --
> >> StormDB - http://www.stormdb.com
> >> The Database Cloud
> >
> >
> >
> >
> > --
> > Best Wishes,
> > Ashutosh Bapat
> > EntepriseDB Corporation
> > The Enterprise Postgres Company
> >
>
>
>
> --
> StormDB - http://www.stormdb.com
> The Database Cloud
>

-- 
Michael Paquier
http://michael.otacoo.com

2010	Jan	Feb	Mar	Apr (10)	May (17)	Jun (3)	Jul	Aug	Sep (8)	Oct (18)	Nov (51)	Dec (74)
2011	Jan (47)	Feb (44)	Mar (44)	Apr (102)	May (35)	Jun (25)	Jul (56)	Aug (69)	Sep (32)	Oct (37)	Nov (31)	Dec (16)
2012	Jan (34)	Feb (127)	Mar (218)	Apr (252)	May (80)	Jun (137)	Jul (205)	Aug (159)	Sep (35)	Oct (50)	Nov (82)	Dec (52)
2013	Jan (107)	Feb (159)	Mar (118)	Apr (163)	May (151)	Jun (89)	Jul (106)	Aug (177)	Sep (49)	Oct (63)	Nov (46)	Dec (7)
2014	Jan (65)	Feb (128)	Mar (40)	Apr (11)	May (4)	Jun (8)	Jul (16)	Aug (11)	Sep (4)	Oct (1)	Nov (5)	Dec (16)
2015	Jan (5)	Feb	Mar (2)	Apr (5)	May (4)	Jun (12)	Jul	Aug	Sep	Oct	Nov	Dec (4)
2019	Jan	Feb	Mar	Apr	May	Jun	Jul (2)	Aug	Sep	Oct	Nov	Dec

S	M	T	W	T	F	S
1	2 (1)	3 (6)	4 (19)	5	6 (15)	7 (2)
8 (2)	9 (22)	10 (20)	11 (20)	12 (14)	13 (12)	14 (2)
15	16 (14)	17 (17)	18 (4)	19 (8)	20 (2)	21 (3)
22	23 (8)	24 (1)	25	26 (2)	27 (1)	28
29	30 (7)	31 (3)

postgres-xc-developers Mailing List for Postgres-XC

postgres-xc-developers — Postgres-XC hackers and developers