在日志表上按顺序使用多个连接优化 MySQL 查询以找到最大差异（最佳记录）

Posted 2023-03-31

技术标签:

【中文标题】在日志表上按顺序使用多个连接优化 MySQL 查询以找到最大差异（最佳记录）【英文标题】：Optimize MySQL query using multiple joins by order on logging table to find greatest diff (best records) 【发布时间】：2020-06-04 09:20:52 【问题描述】：

假设我们有一个表格来记录用户的体重和其他信息，如下所示：

health_indexes

id | user_id | weight | created_at
---+---------+--------+-----------
1  | 50      | 100    | 2020-01-01
2  | 50      | 98     | 2020-01-05
3  | 50      | 98.5   | 2020-01-10
4  | 50      | 92     | 2020-01-15
5  | 50      | 80     | 2020-01-20
.
.
.
10 | 100     | 130    | 2018-01-01
11 | 100     | 149999 | 2018-01-05
12 | 100     | 159999 | 2018-01-10
13 | 100     | 120    | 2018-01-15
.
.
.
20 | 200     | 87     | 2020-02-01
.
.
.
30 | 300     | 140    | 2020-01-01

我确实得到了下表，但我正在寻找更好的方法：

user_id | first_weight | first_created_at | last_weight | last_created_at
--------+--------------+------------------+-------------+----------------
50      | 100          | 2020-01-01       | 80          | 2020-01-20
100     | 130          | 2018-01-01       | 120         | 2018-01-15

查询：

select u.id user_id,
    (select weight from health_indexes where user_id = u.id order by created_at limit 1) first_weight,
    (select created_at from health_indexes where user_id = u.id order by created_at limit 1) first_created_at,
    (select weight from health_indexes where user_id = u.id order by created_at desc limit 1) last_weight,
    (select created_at from health_indexes where user_id = u.id order by created_at desc limit 1) last_created_at
from users u
group by u.id
having first_weight > last_weight
order by (first_weight - last_weight) desc
limit 50;

我正在寻找一种在 health_indexes 上加入两次以获得相同结果的方法。有什么想法吗？

【问题讨论】：

【参考方案1】：

如果您使用的是 mysql 8.0，则可以仅使用窗口函数执行此操作，无需任何连接。 distinct 可以与窗口函数结合使用是一种罕见的情况：

select distinct
    user_id,
    first_value(weight) over(partition by user_id order by created_at)       first_weight,
    min(created_at)     over(partition by user_id)                           first_created_at,
    first_value(weight) over(partition by user_id order by created_at desc)  last_weight,
    max(created_at)     over(partition by user_id)                           last_created_at
from health_indexes

在早期版本中，一个选项使用连接和过滤：

select 
    hi1.user_id,
    hi1.weight first_weight,
    hi1.created_at first_created_at,
    hi2.weight last_weight,
    hi2.created_at last_created_at
from health_indexes hi1
inner join health_indexes hi2 on hi2.user_id = hi1.user_id
where
        hi1.created_at = (select min(h3.created_at) from health_indexes hi3 where hi3.user_id = hi1.user_id)
    and hi2.created_at = (select max(h3.created_at) from health_indexes hi3 where hi3.user_id = hi2.user_id)

【讨论】：

嗯...要学习的新东西.... :thinking:, 我会试试你的解决方案，谢谢@GMB ERROR 1064 (42000)：您的 SQL 语法有错误；检查与您的 MySQL 服务器版本相对应的手册，以在第 3 行的 '(partition by user_id order by created_at) first_weight, min(created_a' 附近使用正确的语法 @MohammadNaji：这表示你的sql版本的解决方案更新了我的答案

以上是关于在日志表上按顺序使用多个连接优化 MySQL 查询以找到最大差异（最佳记录）的主要内容，如果未能解决你的问题，请参考以下文章

具有多个表的联结表上的 MySQL SELECT 查询