Man Linux: Main Page and Category List

NAME

       table_dispatcher - PgQ consumer that is used to write source records
       into partitoned table.

SYNOPSIS

           table_dispatcher.py [switches] config.ini

DESCRIPTION

       table_dispatcher is PgQ consumer that reads url encoded records from
       source queue and writes them into partitioned tables according to
       configuration file. Used to partiton data. For example change logĀ“s
       that need to kept online only shortly can be written to daily tables
       and then dropped as they become irrelevant. Also allows to select which
       columns have to be written into target database Creates target tables
       according to configuration file as needed.

QUICK-START

       Basic table_dispatcher setup and usage can be summarized by the
       following steps:

        1.  PgQ must be installed in source database. See pgqadm man page for
           details. Target database must have pgq_ext schema installed.

        2.  edit a table_dispatcher configuration file, say
           table_dispatcher_sample.ini

        3.  create source queue

               $ pgqadm.py ticker.ini create <queue>

        4.  launch table dispatcher in daemon mode

               $ table_dispatcher.py table_dispatcher_sample.ini -d

        5.  start producing events

CONFIG

   Common configuration parameters
       job_name
           Name for particulat job the script does. Script will log under this
           name to logdb/logserver. The name is also used as default for PgQ
           consumer name. It should be unique.

       pidfile
           Location for pid file. If not given, script is disallowed to
           daemonize.

       logfile
           Location for log file.

       loop_delay
           If continuisly running process, how long to sleep after each work
           loop, in seconds. Default: 1.

       connection_lifetime
           Close and reconnect older database connections.

       use_skylog
           foo.

   Common PgQ consumer parameters
       pgq_queue_name
           Queue name to attach to. No default.

       pgq_consumer_id
           Consumers ID to use when registering. Default: %(job_name)s

   table_dispatcher parameters
       src_db
           Source database.

       dst_db
           Target database.

       dest_table
           Where to put data. when partitioning, will be used as base name

       part_field
           date field with will be used for partitioning.

       part_template
           SQL code used to create partition tables. Various magic
           replacements are done there:

           _PKEY           comma separated list of
                           primery key columns.
           _PARENT         schema-qualified parent
                           table name.
           _DEST_TABLE     schema-qualified partition
                           table.
           _SCHEMA_TABLE   same as DEST_TABLE but
                           dots replaced with "_", to
                           allow use as index names.

   Example config
           [table_dispatcher]
           job_name          = table_dispatcher_source_table_targetdb

           src_db            = dbname=sourcedb
           dst_db            = dbname=targetdb

           pgq_queue_name    = sourceq

           logfile           = log/%(job_name)s.log
           pidfile           = pid/%(job_name)s.pid

           # where to put data.  when partitioning, will be used as base name
           dest_table        = orders

           # names of the fields that must be read from source records
           fields            = id, order_date, customer_name

           # date field with will be used for partitioning
           part_field        = order_date

           # template used for creating partition tables
           part_template     =
                create table _DEST_TABLE () inherits (orders);
                alter table only _DEST_TABLE add constraint _DEST_TABLE_pkey primary key (id);
                grant select on _DEST_TABLE to group reporting;

COMMAND LINE SWITCHES

       Following switches are common to all skytools.DBScript-based Python
       programs.

       -h, --help
           show help message and exit

       -q, --quiet
           make program silent

       -v, --verbose
           make program more verbose

       -d, --daemon
           make program go background

       Following switches are used to control already running process. The
       pidfile is read from config then signal is sent to process id specified
       there.

       -r, --reload
           reload config (send SIGHUP)

       -s, --stop
           stop program safely (send SIGINT)

       -k, --kill
           kill program immidiately (send SIGTERM)

LOGUTRIGA EVENT FORMAT

       PgQ trigger function pgq.logutriga() sends table change event into
       queue in following format:

       ev_type

           (op || ":" || pkey_fields). Where op is either "I", "U" or "D",
           corresponging to insert, update or delete. And pkey_fields is
           comma-separated list of primary key fields for table. Operation
           type is always present but pkey_fields list can be empty, if table
           has no primary keys. Example: I:col1,col2

       ev_data
           Urlencoded record of data. It uses db-specific urlecoding where
           existence of = is meaningful - missing = means NULL, present =
           means literal value. Example: id=3&name=str&nullvalue&emptyvalue=

       ev_extra1
           Fully qualified table name.

                                  09/22/2008