postgres-ai
diff --git a/‎HELP.md‎
Lines changed: 35 additions & 35 deletions b/‎HELP.md‎
Lines changed: 35 additions & 35 deletions
diff --git a/‎LICENSE‎
Lines changed: 1 addition & 1 deletion b/‎LICENSE‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README.md‎
Lines changed: 64 additions & 64 deletions b/‎README.md‎
Lines changed: 64 additions & 64 deletions
diff --git a/‎checkup‎
Lines changed: 1 addition & 1 deletion b/‎checkup‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎pghrep/LICENSE‎ b/‎pghrep/LICENSE‎
@@ -7,20 +7,20 @@ This group determines the available resources such as hardware characteristics o
 General information about operational systems where the observed Postgres master and its replicas operate.
 
 > Insights:
-> 
+>
 > - Hardware and software differences (OS versions, Linux kernel versions, CPU, Memory). If the observed master and its replicas run on different platforms, it might cause issues with binary replication.
->  
+>
 > - Memory settings tuning. (Examples: is swap enabled? Are huge pages used?) Observing state of memory about memory consumption by database may lead to recommendations of changes to improve system performance.
->  
+>
 > - Information about virtualization type.
 
 
 ### A002 Postgres Version Information
 
 This report answers the following questions:
-- Do all nodes have the same Postgres version?  
+- Do all nodes have the same Postgres version?
 - Is the minor version being used up-to-date? Keeping the minor version of the database up-to-date is recommended to decrease chances to encounter with bugs, performance and security issues?
-- Is the major version currently supported by the community?  
+- Is the major version currently supported by the community?
 - Will the major version be supported by the community during the next 12 months?
 - If the minor version is not the most recent, are any critical bugfixes released that need to be applied ASAP?
 
@@ -40,26 +40,26 @@ The following is included:
 - The uptime. Sometimes low uptime may indicate an unplanned, accidental restart of the database.
 - General information: how many databases are on one instance, what is their size, replication mode, age of statistics.
 - Information about replicas, replication modes, replication delays.
-- Ratio of forced checkpoints among all checkpoints registered since statistics reset time. 
-> Insights: Frequent checkpoints in most cases create an excessive load on the disk subsystem. Identifying this fact will allow the more optimal disk utilization. 
-- How big is the observed database (the cluster may have multiple databases)? 
-> Insight: if the database is smaller than RAM, there are good chances to avoid intensive disk IO in most operations 
-- Cache Effectiveness: percentage of buffer pool hits. 
-> Insight: if it is not more than 95% on all nodes, it might be a good sign that the buffer pool size needs to be increased.   
-- Successful Commits: percentage of successfully committed transactions. 
-> Insight: if the value is not more than 99%, it might be a sign of logic issues with application code leading to high rates of ROLLBACK events. 
-- Temp Files per day: how many temporary files were generated per day in average, since last statistics reset time. 
-> Insight: if this value is high (thousands), it is a signal that work_mem should be increased. 
-- Deadlocks per day. 
-> Insight: significant (dozens) daily number of deadlocks is a sign of issues with application logic that needs redesign. 
+- Ratio of forced checkpoints among all checkpoints registered since statistics reset time.
+> Insights: Frequent checkpoints in most cases create an excessive load on the disk subsystem. Identifying this fact will allow the more optimal disk utilization.
+- How big is the observed database (the cluster may have multiple databases)?
+> Insight: if the database is smaller than RAM, there are good chances to avoid intensive disk IO in most operations
+- Cache Effectiveness: percentage of buffer pool hits.
+> Insight: if it is not more than 95% on all nodes, it might be a good sign that the buffer pool size needs to be increased.
+- Successful Commits: percentage of successfully committed transactions.
+> Insight: if the value is not more than 99%, it might be a sign of logic issues with application code leading to high rates of ROLLBACK events.
+- Temp Files per day: how many temporary files were generated per day in average, since last statistics reset time.
+> Insight: if this value is high (thousands), it is a signal that work_mem should be increased.
+- Deadlocks per day.
+> Insight: significant (dozens) daily number of deadlocks is a sign of issues with application logic that needs redesign.
 
 ### A005 Extensions
 
 Provides a list of all available and installed (in the current observed database) extensions, with versions. Insight: if there is a newer version of an installed extension, the report will highlight it, meaning that update is needed.
 
 ### A006 Postgres Setting Deviations
 
-Helps to check that there are no differences in Postgres configuration on the observed nodes (except `transaction_read_only` and pg_stat_kcache’s `linux_hz`). 
+Helps to check that there are no differences in Postgres configuration on the observed nodes (except `transaction_read_only` and pg_stat_kcache’s `linux_hz`).
 
 > Insights:
 > - In general, any differences in configuration on master and its replicas might lead to issues in case of failover. An example: the master is tuned, while replicas are not tuned at all or tuned poorly, in the event of failover, a new master cannot operate properly due to poor tuning.
@@ -72,13 +72,13 @@ There are multiple ways to change database settings globally:
 - explicitly, in the configuration file postgresql.conf, and
 - implicitly, using 'ALTER SYSTEM' commands.
 
-This report checks if there are settings which were set by implicit (ALTER SYSTEM) way.  
+This report checks if there are settings which were set by implicit (ALTER SYSTEM) way.
 
 Possible sources of configuration settings (presented in the first column of the report’s table):
 
 * `postgresql.auto.conf`: changed via 'ALTER SYSTEM' command.
 * `%any other file pattern%`: changed in additional config included to the main one.
-* `postgresql.conf`: non-default values are set in postgresql.conf.  
+* `postgresql.conf`: non-default values are set in postgresql.conf.
 
 ### A008 Disk Usage and File System Type
 
@@ -132,11 +132,11 @@ Shows global and per-table (if any) autovacuum-related Postgres settings.
 
 > Insights:
 > - Is any tuning applied (values are not default)?
-> - Are there any custom table autovacuum settings? There are cases when the tables have a custom autovacuum configuration. Tracking such tables will allow you to understand the nature of the functioning of autovacuum workers. Such tables are marked with asterisk (*) in the following reports.
+> - Are there any custom table autovacuum settings? There are cases when the tables have a custom autovacuum configuration. Tracking such tables will allow you to understand the nature of the functioning of autovacuum workers. Such tables are marked with asterisk (\*) in the following reports.
 
-### F002 Autovacuum: Transaction Wraparound Check
+### F002 Autovacuum: Transaction ID Wraparound Check
 
-Shows a distance in % to transaction wraparound disaster for every database.
+Shows a distance in % to transaction ID wraparound disaster for every database.
 
 > Insights:
 > If % is higher than 50%, autovacuum tuning should be considered as soon as possible.
@@ -160,14 +160,14 @@ Estimated table and index bloat is presented in this report.
 > - Objects with a high percentage of bloat lead to wasted disk space, degradation in query performance, additional CPU costs, and excessive read load on the disk.
 > This report is based on estimations. The errors in bloat estimates may be significant (in some cases, up to 15% and event more). Use it only as an indicator of potential issues.
 > - Checks the following things:
->     - Extreme (>90%) level of heap or index bloat estimated. 
->     - Significant (>40%) level of heap or index bloat estimated. 
+>     - Extreme (>90%) level of heap or index bloat estimated.
+>     - Significant (>40%) level of heap or index bloat estimated.
 
 ### F008 Autovacuum: Resource Usage
 
-Shows a table with Postgres settings related to autovacuum resource usage.  
+Shows a table with Postgres settings related to autovacuum resource usage.
 
-> Insights:  
+> Insights:
 > - Is  `autovacuum_max_workers`  not default? (When CPU cores or vCPUs >= 10).
 > - Is `autovacuum_vacuum_cost_limit` / `vacuum_cost_limit` not default?
 > - Isn't `maintenance_work_mem` / `autovacuum_work_mem` too low compared to table sizes and RAM?
@@ -201,7 +201,7 @@ A detailed snapshot report of all connections, grouped by users, databases and s
 Provides information about how "timeout" and locking-related settings are tuned, shows deadlocks counter for every database since statistics reset.
 
 > Insights:
-> - Questions worth answering:  
+> - Questions worth answering:
 >     - Is `statement_timeout` > 0 and <= 30 seconds (good choice for an OLTP system)?
 >     - Is `idle_in_transaction_session_timeout` >0 and < 20 minutes (preventing autovacuum and locking issues)?
 >     - Is `max_locks_per_transaction` not default (for example, low value may interrupt pg_dump)?
@@ -218,14 +218,14 @@ Shows the list of never used, rarely used and redundant indexes.
 Helps to understand how much space they occupy.
 
 > Insights:
-> - Questions worth answering:  
+> - Questions worth answering:
 >     - Is the total size of unused indexes less than 10% of the DB size (only if statistics is older than 1 week)?
 >     - Is statistics saved across restarts?
 > - If statistics age is low, the report should be used with caution.
 
 ### H003 Non-indexed Foreign Keys
 
-Checks if all foreign keys have indexes in referencing tables.  
+Checks if all foreign keys have indexes in referencing tables.
 
 # K. SQL Query Analysis
 
@@ -247,7 +247,7 @@ The grouping is based on the first word of every query.
 
 One of the most comprehensive and deep reports. Shows Top query groups
 ordered by total execution time during the observation period (`total_time` in
-pg_stat_statements). Good start for query optimization. 
+pg_stat_statements). Good start for query optimization.
 
 > Insights:
 > - The first question to answer: Are there any query groups with `total_time` ratio >50% of overall `total_time`?  If we have this type of query, it is definitely worth optimizing it.
@@ -262,11 +262,11 @@ face of a growing amount of data.
 
 ### L001 Table Sizes
 
-Displays the size of tables and their components (indexes, TOAST, the table itself).  
+Displays the size of tables and their components (indexes, TOAST, the table itself).
 
-> - Questions worth answering:  
->     - Does the size of indexes for each table not exceed heap (with toast) size? 
->     - Are there any non-indexes tables which size is > 10 MiB?  
+> - Questions worth answering:
+>     - Does the size of indexes for each table not exceed heap (with toast) size?
+>     - Are there any non-indexes tables which size is > 10 MiB?
 >     - Are there any non-partitioned tables of size > 100 GiB?
 
 ### L003 Integer (int2, int4) Out-of-range Risks in PKs
 
@@ -15,7 +15,7 @@ modification, are permitted provided that the following conditions are met:
 * Neither the name of the copyright holder nor the names of its
   contributors may be used to endorse or promote products derived from
   this software without specific prior written permission.
-  
+
 * Redistributions of any form whatsoever and integration into third-party
   products (including but not limited to cloud services) must retain the
   following acknowledgment in the documentation and copyright notices:
 
@@ -162,7 +162,8 @@ Which literally means: connect to the server with given credentials, save data i
 project directory, as epoch of check `1`. Epoch is a numerical (**integer**) sign of current iteration.
 For example: in half a year we can switch to "epoch number `2`".
 
-`-h db2.vpn.local` means: try to connect to host via SSH and then use remote `psql` command to perform checks.  
+`-h db2.vpn.local` means: try to connect to host via SSH and then use remote `psql` command to perform checks.
+
 If SSH is not available the local 'psql' will be used (non-psql reports will be skipped).
 
 For comprehensive analysis, it is recommended to run the tool on the master and
@@ -187,7 +188,7 @@ for host in db2.vpn.local db3.vpn.local db4.vpn.local; do
     -e 1 \
     --file resources/checks/K000_query_analysis.sh # the first snapshot is needed only for reports K***
 done
-  
+
 sleep "$DISTANCE"
 
 for host in db2.vpn.local db3.vpn.local db4.vpn.local; do
@@ -244,15 +245,14 @@ We need to know a hostname or an ip address of target database to be used with `
 PG_HOST=$(docker inspect --format '{{ .NetworkSettings.IPAddress }}' postgres)
 ```
 
-You can use official images or build an image yourself. 
-Run this command to build an image:
+You can use official images or build an image yourself. Run this command to build an image:
 
 ```bash
 docker build -t postgres-checkup .
 ```
 
-Then run a container with `postgres-checkup`. 
-This command run the tool using Postgres connection only (without SSH):
+Then run a container with `postgres-checkup`. This command run the tool using
+Postgres connection only (without SSH):
 
 ```bash
 docker run --rm \
@@ -323,89 +323,89 @@ Docker support implemented by [Ivan Muratov](https://gitlab.com/binakot).
 
 ## А. General  / Infrastructural
 
-- [x] A001 System, CPU, RAM, disks, virtualization #6 , #56 , #57 , #86 
+- [x] A001 System, CPU, RAM, disks, virtualization #6 , #56 , #57 , #86
 - [x] A002 PostgreSQL versions (Simple) #68, #21, #86
-- [x] A003 Collect pg_settings  #15, #167, #86 
-- [x] A004 General cluster info  #7, #58, #59, #86, #162  
-- [x] A005 Extensions #8, #60, #61, #86, #167   
-- [x] A006 Config diff  #9, #62, #63, #86  
-- [x] A007 ALTER SYSTEM vs postgresql.conf #18, #86  
-- [x] A008 Disk usage and file system type #19, #20  
-- [ ] A010 Data checksums, wal_log_hints #22  
-- [ ] A011 Connection pooling. pgbouncer #23  
-- [ ] A012 Anti-crash checks #177  
-
-## B. Backups and DR  
-
-- [ ] B001 SLO/SLA, RPO, RTO  #24  
-- [ ] B002 File system, mount flags #25  
-- [ ] B003 Full backups / incremental  #26  
-- [ ] B004 WAL archiving (GB/day?) - #27  
-- [ ] B005 Restore checks, monitoring, alerting  #28  
+- [x] A003 Collect pg_settings  #15, #167, #86
+- [x] A004 General cluster info  #7, #58, #59, #86, #162
+- [x] A005 Extensions #8, #60, #61, #86, #167
+- [x] A006 Config diff  #9, #62, #63, #86
+- [x] A007 ALTER SYSTEM vs postgresql.conf #18, #86
+- [x] A008 Disk usage and file system type #19, #20
+- [ ] A010 Data checksums, wal_log_hints #22
+- [ ] A011 Connection pooling. pgbouncer #23
+- [ ] A012 Anti-crash checks #177
+
+## B. Backups and DR
+
+- [ ] B001 SLO/SLA, RPO, RTO  #24
+- [ ] B002 File system, mount flags #25
+- [ ] B003 Full backups / incremental  #26
+- [ ] B004 WAL archiving (GB/day?) - #27
+- [ ] B005 Restore checks, monitoring, alerting  #28
 
 ## C. Replication and HA
 
-- [ ] C001 SLO/SLA  #29  
-- [ ] C002 Sync/async, Streaming / wal transfer; logical decoding #30  
-- [ ] C003 SPOFs; “-1 datacenter”, standby with traffic #31  
-- [ ] C004 Failover #32  
-- [ ] C005 Switchover #33  
-- [ ] C006 Delayed replica (replay of 1 day of WALs) - #34  
+- [ ] C001 SLO/SLA  #29
+- [ ] C002 Sync/async, Streaming / wal transfer; logical decoding #30
+- [ ] C003 SPOFs; “-1 datacenter”, standby with traffic #31
+- [ ] C004 Failover #32
+- [ ] C005 Switchover #33
+- [ ] C006 Delayed replica (replay of 1 day of WALs) - #34
 
-## D. Monitoring / Troubleshooting   
+## D. Monitoring / Troubleshooting
 
-- [ ] D001 Logging (syslog?), log_*** #35  
-- [x] D002 Useful Linux tools  #36  
-- [ ] D003 List of monitoring metrics #37  
-- [x] D004 pg_stat_statements, tuning opts, pg_stat_kcache #38  
-- [ ] D005 track_io_timing, …, auto_explain  #39  
-- [ ] D006 Recommended DBA toolsets: postgres_dba, pgCenter, pgHeroother  #40  
-- [ ] D007 Postgres-specific tools for troubleshooting  #137  
+- [ ] D001 Logging (syslog?), log_*** #35
+- [x] D002 Useful Linux tools  #36
+- [ ] D003 List of monitoring metrics #37
+- [x] D004 pg_stat_statements, tuning opts, pg_stat_kcache #38
+- [ ] D005 track_io_timing, …, auto_explain  #39
+- [ ] D006 Recommended DBA toolsets: postgres_dba, pgCenter, pgHeroother  #40
+- [ ] D007 Postgres-specific tools for troubleshooting  #137
 
 ## E. WAL, Checkpoints
 
-- [ ] E001 WAL/checkpoint settings, IO  #41   
-- [ ] E002 Checkpoints, bgwriter, IO  #42  
+- [ ] E001 WAL/checkpoint settings, IO  #41
+- [ ] E002 Checkpoints, bgwriter, IO  #42
 
 ## F. Autovacuum, Bloat
 
-- [x] F001 < F003 Current autovacuum-related settings  #108, #164    
-- [x] F002 < F007 Transaction wraparound check  #16, #171  
-- [x] F003 < F006 Dead tuples  #164   
-- [x] F004 < F001 Heap bloat estimation #87, #122  
-- [x] F005 < F002 Index bloat estimation #88  
-- [ ] F006 < F004 Precise heap bloat analysis 
-- [ ] F007 < F005 Precise index bloat analysis 
-- [x] F008 < F008 Resource usage (CPU, Memory, disk IO) #44  
+- [x] F001 < F003 Current autovacuum-related settings  #108, #164
+- [x] F002 < F007 Transaction ID wraparound check  #16, #171
+- [x] F003 < F006 Dead tuples  #164
+- [x] F004 < F001 Heap bloat estimation #87, #122
+- [x] F005 < F002 Index bloat estimation #88
+- [ ] F006 < F004 Precise heap bloat analysis
+- [ ] F007 < F005 Precise index bloat analysis
+- [x] F008 < F008 Resource usage (CPU, Memory, disk IO) #44
 
-## G. Performance / Connections / Memory-related Settings 
+## G. Performance / Connections / Memory-related Settings
 
-- [x] G001 Memory-related settings #45, #190  
-- [x] G002 Connections #46  
-- [x] G003 Timeouts, locks, deadlocks (amount) #47  
-- [ ] G004 Query planner (diff) #48   
-- [ ] G005 I/O settings #49   
-- [ ] G006 Default_statistics_target (plus per table?) #50   
+- [x] G001 Memory-related settings #45, #190
+- [x] G002 Connections #46
+- [x] G003 Timeouts, locks, deadlocks (amount) #47
+- [ ] G004 Query planner (diff) #48
+- [ ] G005 I/O settings #49
+- [ ] G006 Default_statistics_target (plus per table?) #50
 
 ## H. Index Analysis
 
-- [x] H001 Indexes: invalid #192, #51  
-- [x] H002 Unused and redundant indexes #51, #180, #170, #168, #322  
-- [x] H003 Missing FK indexes #52, #142, #173  
+- [x] H001 Indexes: invalid #192, #51
+- [x] H002 Unused and redundant indexes #51, #180, #170, #168, #322
+- [x] H003 Missing FK indexes #52, #142, #173
 
 ## J.  Capacity Planning
 
-- [ ] J001 Capacity planning - #54  
+- [ ] J001 Capacity planning - #54
 
 ## K. SQL query Analysis
 
-- [x] K001 Globally aggregated query metrics #158, #178, #182, #184  
-- [x] K002 Workload type ("first word" analysis) #159, #178, #179, #182, #184  
+- [x] K001 Globally aggregated query metrics #158, #178, #182, #184
+- [x] K002 Workload type ("first word" analysis) #159, #178, #179, #182, #184
 - [x] K003 Top queries by total_time  #160, #172, #174, #178, #179, #182, #184, #193
 
 ## L. DB Schema Analysis
-- [x] L001 (was: H003) Current sizes of DB objects (tables, indexes, mat. views)  #163  
-- [ ] L002 (was: H004) Data types being used #53  
+- [x] L001 (was: H003) Current sizes of DB objects (tables, indexes, mat. views)  #163
+- [ ] L002 (was: H004) Data types being used #53
 - [x] L003 Integer (int2, int4) out-of-range risks in PKs // calculate capacity remained; optional: predict when capacity will be fully used) https://gitlab.com/postgres-ai-team/postgres-checkup/issues/237
 
 ## TODO:
@@ -414,7 +414,7 @@ Docker support implemented by [Ivan Muratov](https://gitlab.com/binakot).
 
 ---
 
-# Ideas :bulb: :bulb: :bulb:  :thinking\_face: 
+# Ideas :bulb: :bulb: :bulb:  :thinking\_face:
 
 - analyze all FKs and check if data types of referencing column and referenced one match (same thing for multi-column FKs)
 - tables w/o PKs? tables not having even unique index?
 
@@ -1110,7 +1110,7 @@ run_checks() {
   done
 
   msg
-  msg "All checks has been finished for host '$HOST'!"
+  msg "All checks have been finished for host '$HOST'!"
 
   # print stacks with failed reports
   if ! [[ -z "${check_failed_json_stack}" ]]; then