Deepgreen & Greenplum DBA小白普及课之一(一般问题解答)

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Deepgreen & Greenplum DBA小白普及课之一(一般问题解答)相关的知识,希望对你有一定的参考价值。

原文链接

摘要: 不积跬步无以至千里,要想成为一名合格的数据库管理员,首先应该具备扎实的基础知识及问题处理能力。本文参考Pivotal官方FAQ,对一些在使用和管理Deepgreen & Greenplum时经常会遇到的普通问题进行讲解。

不积跬步无以至千里,要想成为一名合格的数据库管理员,首先应该具备扎实的基础知识及问题处理能力。本文参考Pivotal官方FAQ,对一些在使用和管理Deepgreen & Greenplum时经常会遇到的普通问题进行解答。希望对大家有所帮助,如果有朋友有更多的问题分享,请留言,我将一并整理。
 
下面单刀直入,开始问题浏览及解决思路梳理:
 
1.如何检查一张表的分区策略?
测试表:region
表的详细描述信息可以展示其分区策略:Distributed by: (r_regionkey)
tpch=# \d region
                             Append-Only Columnar Table "public.region"
   Column    |          Type          |                          Modifiers                           
-------------+------------------------+--------------------------------------------------------------
 r_regionkey | integer                | not null default nextval(‘region_r_regionkey_seq‘::regclass)
 r_name      | character(25)          |
 r_comment   | character varying(152) |
Checksum: t
Distributed by: (r_regionkey)
2.如何查看数据库中有多少用户模式?
在psql中使用\dn进行查看
tpch=# \dn
       List of schemas
        Name        |  Owner 
--------------------+---------
 gp_toolkit         | dgadmin
 information_schema | dgadmin
 pg_aoseg           | dgadmin
 pg_bitmapindex     | dgadmin
 pg_catalog         | dgadmin
 pg_toast           | dgadmin
 public             | dgadmin
(7 rows)
3.如何查看我的表上一次ANALYZE的时间?
测试表:region
通过pg_stat_operations视图可以查看任何对象的所有操作
tpch=# select objname,actionname,statime from pg_stat_operations  where objname like ‘region‘;
 objname | actionname |            statime           
---------+------------+-------------------------------
 region  | CREATE     | 2017-05-21 00:32:28.672208+08
 region  | ANALYZE    | 2017-06-30 06:55:57.658525+08
(2 rows)
4.如何查看表的大小?
测试表:customer,在模式public下
查看表的大小:
tpch=# select pg_size_pretty(pg_relation_size(‘public.customer‘));
 pg_size_pretty
----------------
 122 MB
(1 row)
查看表和索引的大小:
tpch=# select pg_size_pretty(pg_total_relation_size(‘public.customer‘));
 pg_size_pretty
----------------
 155 MB
(1 row)
5.如何查看模式(schema)大小?
测试模式:public,查询时只需将下面where条件schemaname=后的public替换成你要查询的schema名称即可。
tpch=# select schemaname ,round(sum(pg_total_relation_size(schemaname||‘.‘||tablename))/1024/1024) "Size_MB" from pg_tables where schemaname=‘public‘ group by 1;
 schemaname | Size_MB
------------+---------
 public     |   10698
(1 row)
6.如何查看数据库大小?
测试数据库:tpch,你查询时只需要替换括号里的tpch为你需要的数据库即可
tpch=# select pg_size_pretty(pg_database_size(‘tpch‘));
 pg_size_pretty
----------------
 10 GB
(1 row)
查看所有数据库的大小:
tpch=# select datname,pg_size_pretty(pg_database_size(datname)) from pg_database;
  datname  | pg_size_pretty
-----------+----------------
 tpch      | 10 GB
 postgres  | 111 MB
 tpch_1g   | 1100 MB
 template1 | 29 MB
 template0 | 27 MB
(5 rows)
7.如何查看分区表大小(包括分区和索引)?
测试表:employee_daily
select schemaname,tablename,round(sum(pg_total_relation_size(schemaname "" ‘.‘ "" partitiontablename))/1024/1024) "MB"
from pg_partitions where tablename=‘employee_daily‘ group by 1,2;
schemaname " tablename      " MB
-----------+----------------+-----
public     " employee_daily " 254
8.如何查看修改表的语法帮助?
在psql中使用【\h +待查语法】:
tpch=# \h alter table
Command:     ALTER TABLE
Description: change the definition of a table
Syntax:
ALTER TABLE [ONLY] name RENAME [COLUMN] column TO new_column

ALTER TABLE name RENAME TO new_name

ALTER TABLE name SET SCHEMA new_schema

ALTER TABLE [ONLY] name SET
     DISTRIBUTED BY (column, [ ... ] )
   | DISTRIBUTED RANDOMLY
   | WITH (REORGANIZE=true|false)

ALTER TABLE [ONLY] name action [, ... ]

ALTER TABLE name
   [ ALTER PARTITION { partition_name | FOR (RANK(number))
                     | FOR (value) } partition_action [...] ]
   partition_action

where action is one of:
  ADD [COLUMN] column_name type
      [ ENCODING ( storage_directive [,...] ) ]
      [column_constraint [ ... ]]
  DROP [COLUMN] column [RESTRICT | CASCADE]
  ALTER [COLUMN] column TYPE type [USING expression]
  ALTER [COLUMN] column SET DEFAULT expression
  ALTER [COLUMN] column DROP DEFAULT
  ALTER [COLUMN] column { SET | DROP } NOT NULL
  ALTER [COLUMN] column SET STATISTICS integer
  ADD table_constraint
  DROP CONSTRAINT constraint_name [RESTRICT | CASCADE]
  DISABLE TRIGGER [trigger_name | ALL | USER]


tpch=# \h create index
Command:     CREATE INDEX
Description: define a new index
Syntax:
CREATE [UNIQUE] INDEX name ON table
       [USING btree|bitmap|gist]
       ( {column | (expression)} [opclass] [, ...] )
       [ WITH ( FILLFACTOR = value ) ]
       [TABLESPACE tablespace]
       [WHERE predicate]
9.如何从Master主机连接到Master或者Segment实例的系统模式?
[email protected]:~$ PGOPTIONS=‘-c gp_session_role=utility‘ psql -p 25432 -h flash -d postgres
psql (8.2.15)
Type "help" for help.
postgres=# \q
其中-p后面接master或者segment的端口号,-h后面接对应的master或者segment主机名,-d后面接数据库名
 
10.从哪里看数据库日志信息?
Master主机:
Master节点的数据库日志存在$MASTER_DATA_DIRECTORY/pg_log/目录下,文件名根据数据库的log_filename参数生成。
[email protected]:~$ gpconfig -s log_filename
Values on all segments are consistent
GUC          : log_filename
Master  value: gpdb-%Y-%m-%d_%H%M%S.csv
Segment value: gpdb-%Y-%m-%d_%H%M%S.csv
     默认安装的日志文件格式为:$MASTER_DATA_DIRECTORY/pg_log/gpdb-yyyy-mm-dd_000000.csv
     gpstart\gpstop\gpstate和另外utility生成的日志存储在:~gpadmin/gpAdminLogs/目录下
Segment主机:
     Primary Segment日志位置用下面语句查询:select dbid,hostname,datadir||‘/pg_log‘ from gp_configuration where content not in (-1) and isprimary is true;
     Mirror Segment日志位置用下面语句查询:Primary Segment日志位置用下面语句查询:select dbid,hostname,datadir||‘/pg_log‘ from gp_configuration where content not in (-1) and isprimary is false;
11.如何查看函数列表?
在psql界面执行:\df schemaname.functionname (schemaname and function name support wildcard characters)
例如:
tpch=# \df pub*.*test*
                       List of functions
 Schema "   Name      " Result data type " Argument data types
--------+-------------+------------------+---------------------
 public " bugtest     " integer          "
 public " test        " boolean          " integer
 public " test        " void             "
(3 rows)
12.如何检查数据库运行状态是否正常?
gpstate是数据库状态检查的一个组件,可以使用gpstate -Q快速查看数据库状态。
[email protected]:/dgdata/primary/dg0/pg_log$ gpstate -Q
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-Starting gpstate with args: -Q
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-local Greenplum Version: ‘postgres (Greenplum Database) 4.3.99.00 build Deepgreen DB‘
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-master Greenplum Version: ‘PostgreSQL 8.2.15 (Greenplum Database 4.3.99.00 build Deepgreen DB) on x86_64-unknown-linux-gnu, compiled by GCC gcc (Ubuntu 4.9.4-2ubuntu1~14.04.1) 4.9.4 compiled on May 18 2017 05:19:19‘
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-Obtaining Segment details from master...
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:--Quick Greenplum database status from Master instance only
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-----------------------------------------------------------
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-# of up segments, from configuration table     = 4
20170701:07:09:37:027224 gpstate:flash:dgadmin-[INFO]:-----------------------------------------------------------
13.如何查看数据库总列表?
在psql界面执行\l (英文字母L的小写):
postgres=# \l
                  List of databases
   Name    |  Owner  | Encoding |  Access privileges 
-----------+---------+----------+---------------------
 postgres  | dgadmin | UTF8     |
 template0 | dgadmin | UTF8     | =c/dgadmin         
                                : dgadmin=CTc/dgadmin
 template1 | dgadmin | UTF8     | =c/dgadmin         
                                : dgadmin=CTc/dgadmin
 tpch      | dgadmin | UTF8     |
 tpch_1g   | dgadmin | UTF8     |
(5 rows)
也可以通过下面sql查询更详细的信息:
postgres=# select * from pg_database;
  datname  | datdba | encoding | datistemplate | datallowconn | datconnlimit | datlastsysoid | datfrozenxid | dattablespace | datconfig |              datacl             
-----------+--------+----------+---------------+--------------+--------------+---------------+--------------+---------------+-----------+----------------------------------
 tpch      |     10 |        6 | f             | t            |           -1 |         10898 |          888 |          1663 |           |
 postgres  |     10 |        6 | t             | t            |           -1 |         10898 |          888 |          1663 |           |
 tpch_1g   |     10 |        6 | f             | t            |           -1 |         10898 |          888 |          1663 |           |
 template1 |     10 |        6 | t             | t            |           -1 |         10898 |          888 |          1663 |           | {=c/dgadmin,dgadmin=CTc/dgadmin}
 template0 |     10 |        6 | t             | f            |           -1 |         10898 |          888 |          1663 |           | {=c/dgadmin,dgadmin=CTc/dgadmin}
(5 rows)
14.如何查看当前你的连接用户是谁?
技术分享

以上是关于Deepgreen & Greenplum DBA小白普及课之一(一般问题解答)的主要内容,如果未能解决你的问题,请参考以下文章

Deepgreen DB简介(转)

Greenplum 架构详解 & Hash Join 算法介绍

升级 greenplum-db

ADBPG&Greenplum成本优化之磁盘水位管理

升级 greenplum-cc-web 到 4.4.0

GreenPlum - 数据库锁 & 连接事务排查命令