SQL 选择表中的某些行,以便它们总和为某个值

Posted

技术标签:

【中文标题】SQL 选择表中的某些行,以便它们总和为某个值【英文标题】:SQL select some rows in a table so that they sum up to certain value 【发布时间】:2012-03-20 12:11:43 【问题描述】:

我怎样才能只选择下表中的一些行,以便它们总和为某个值?

Table
-----
id | qty1 | qty2 | qty3 | qty4
------------------------------
1  | 0.0  | 0.0  | 10   | 20
2  | 1.5  | 0.0  | 7.5  | 18
3  | 1.0  | 2.0  | 7.5  | 18
4  | 0.0  | 0.5  | 5    | 13

比方说,我想要的最高值是 57...

所以我需要从上一个表中选择行,使得每行的 qty1+qty2+qty3+qty4,直到 57 值,然后丢弃其他行。在这个例子中,我会得到以下信息:

id | qty1 | qty2 | qty3 | qty4
------------------------------
1  | 0.0  | 0.0  | 10   | 20
2  | 1.5  | 0.0  | 7.5  | 18

因为 10+20+1.5+7.5+18 = 57,所以我舍弃了第 3 行和第 4 行...

现在我希望最高值为 50,那么我应该得到:

id | qty1 | qty2 | qty3 | qty4
------------------------------
1  | 0.0  | 0.0  | 10   | 20
2  | 1.5  | 0.0  | 7.5  | 11

由于这些值的总和为 50,并且 row2,qty4 中的 7 被忽略... (顺便说一句,行以这种特殊方式排序,因为这是我希望计算 qtys 总和的顺序......例如,先总结第一行,然后是 3,然后是 2,然后是 4,这是无效的......它们应始终按 1、2、3、4 的顺序相加...)

如果我想要这个的补充怎么办?我的意思是,我在最后一个结果中没有得到的另外两行。

第一种情况:

id | qty1 | qty2 | qty3 | qty4
------------------------------
3  | 1.0  | 2.0  | 7.5  | 18
4  | 0.0  | 0.5  | 5    | 13

第二种情况:

id | qty1 | qty2 | qty3 | qty4
------------------------------
2  | 0.0  | 0.0  | 0.0  | 7
3  | 1.0  | 2.0  | 7.5  | 18
4  | 0.0  | 0.5  | 5    | 13

(如果第二种情况太复杂,获取如何:

id | qty1 | qty2 | qty3 | qty4
------------------------------
1  | 0.0  | 0.0  | 10   | 20

因为将第 2 行的原始数量加起来会超过 50 的值,所以我将其丢弃... 这种情况下的补码应该是:

id | qty1 | qty2 | qty3 | qty4
------------------------------
2  | 1.5  | 0.0  | 7.5  | 18
3  | 1.0  | 2.0  | 7.5  | 18
4  | 0.0  | 0.5  | 5    | 13

)

【问题讨论】:

我写了很多复杂的查询,我喜欢接受挑战,但这是极少数情况之一,只需要你用你选择的语言编写程序代码。 即使是简化的第二种情况?文章末尾括号中的最后一个...? 简化的第二种情况可用作查询。如果您修改您的问题以仅要求该问题(或创建一个仅要求该问题的新问题),我可以提供帮助。 fo 300 我将不得不向您解释我所面临的实际问题的每一个细节,而不是在这里仅展示抽象的特定细节作为示例,因此您提供的解决方案不仅仅是概括我必须翻译或改编,但我需要解决我的问题的实际解决方案。我会让你签署一份保密协议,因为这些细节是我工作中的事情,不应该为任何人所知...... ;) 【参考方案1】:

括号里的简化选项还不错:

SELECT foo1.*
  FROM foo AS foo1
  JOIN foo AS foo2
    ON foo2.id <= foo1.id
 GROUP
    BY foo1.id
HAVING SUM(foo2.qty1 + foo2.qty2 + foo2.qty3 + foo2.qty4) <= 57
;

(你没有提到表的名称,所以我选择了foo。)

补语是:

SELECT *
  FROM foo
 WHERE id NOT IN
        ( SELECT foo1.id
            FROM foo AS foo1
            JOIN foo AS foo2
              ON foo2.id <= foo1.id
           GROUP
              BY foo1.id
          HAVING SUM(foo2.qty1 + foo2.qty2 + foo2.qty3 + foo2.qty4) <= 57
        )
;

不带括号的选项要复杂得多;这是可行的,但你最好使用stored procedure。

【讨论】:

【参考方案2】:

让我们这样说吧:如果 SQL 是一种宗教,我会因为提供这个解决方案而下地狱。 SQL 并不是为了解决这类问题,所以任何解决方案都会很糟糕。我的也不例外:)

set @limitValue := 50;
select id, newQty1, newQty2, newQty3, newQty4 from (
  select id,
  if(@limitValue - qty1 > 0, qty1, greatest(@limitValue, 0)) newQty1,
  @limitValue := @limitValue - qty1 Total1,
  if(@limitValue - qty2 > 0, qty2, greatest(@limitValue, 0)) newQty2,
  @limitValue := @limitValue - qty2 Total2,
  if(@limitValue - qty3 > 0, qty3, greatest(@limitValue, 0)) newQty3,
  @limitValue := @limitValue - qty3 Total3,
  if(@limitValue - qty4 > 0, qty4, greatest(@limitValue, 0)) newQty4,
  @limitValue := @limitValue - qty4 Total4
  from (
    select id, qty1, qty2, qty3, qty4,
      @rowTotal < @limitValue Useful,
      @previousRowTotal := @rowTotal PreviousRowTotal,
      @rowTotal := @rowTotal + qty1 + qty2 + qty3 + qty4 AllRowsTotal,
      @rowTotal - @previousRowTotal CurrentRowTotal 
    from t,
    (select @rowTotal := 0, @previousRowTotal := 0) S1
  ) MarkedUseful
  where useful = 1
) Final

对于提供的数据,这会导致:

+----+---------+---------+---------+---------+
| ID | NEWQTY1 | NEWQTY2 | NEWQTY3 | NEWQTY4 |
+----+---------+---------+---------+---------+
|  1 | 0       |       0 | 10      |      20 |
|  2 | 1.5     |       0 | 7.5     |      11 |
+----+---------+---------+---------+---------+

还有补语:

set @limitValue := 50;
select t1.id,
  coalesce(t1.qty1 - newQty1, t1.qty1) newQty1,
  coalesce(t1.qty2 - newQty2, t1.qty2) newQty2,
  coalesce(t1.qty3 - newQty3, t1.qty3) newQty3,
  coalesce(t1.qty4 - newQty4, t1.qty4) newQty4
from t t1 left join (
    select id,
    if(@limitValue - qty1 > 0, qty1, greatest(@limitValue, 0)) newQty1,
    @limitValue := @limitValue - qty1 Total1,
    if(@limitValue - qty2 > 0, qty2, greatest(@limitValue, 0)) newQty2,
    @limitValue := @limitValue - qty2 Total2,
    if(@limitValue - qty3 > 0, qty3, greatest(@limitValue, 0)) newQty3,
    @limitValue := @limitValue - qty3 Total3,
    if(@limitValue - qty4 > 0, qty4, greatest(@limitValue, 0)) newQty4,
    @limitValue := @limitValue - qty4 Total4
    from (
      select id, qty1, qty2, qty3, qty4,
        @rowTotal < @limitValue Useful,
        @previousRowTotal := @rowTotal PreviousRowTotal,
        @rowTotal := @rowTotal + qty1 + qty2 + qty3 + qty4 AllRowsTotal,
        @rowTotal - @previousRowTotal CurrentRowTotal 
      from t,
      (select @rowTotal := 0, @previousRowTotal := 0) S1
    ) MarkedUseful
    where useful = 1
) Final
on t1.id = final.id
where Total1 < 0 or Total2 < 0 or Total3 < 0 or Total4 < 0 or final.id is null

对于提供的数据,这会导致:

+----+---------+---------+---------+---------+
| ID | NEWQTY1 | NEWQTY2 | NEWQTY3 | NEWQTY4 |
+----+---------+---------+---------+---------+
|  2 |       0 | 0       | 0       |       7 |
|  3 |       1 | 2       | 7.5     |      18 |
|  4 |       0 | 0.5     | 5       |      13 |
+----+---------+---------+---------+---------+

享受吧!

【讨论】:

刚刚尝试了提供的解决方案,但是对于补充情况,使用相同的示例数据,我没有得到您显示的结果,而是数据表中的原始 4 行... 现在可以了!我不知道我上次做错了什么:)让我测试一下并将其应用到我的真实案例中,我会给你一个关于赏金的答案;)【参考方案3】:

您应该只调整init 子查询中的@limit 变量初始化。第一个查询输出数据达到限制,第二个查询输出其补码。

SELECT
  id,
  @qty1 as qty1,
  @qty2 as qty2,
  @qty3 as qty3,
  @qty4 as qty4
FROM quantities q,
  (SELECT @qty1:=0.0, @qty2:=0.0,
          @qty3:=0.0, @qty4:=0.0,
          @limit:=50.0) init
WHERE
  IF(@limit > 0,
     GREATEST(1,
       IF(@limit-qty1 >=0,
          @limit:=(@limit-(@qty1:=qty1)),
          @qty1:=@limit + LEAST(@limit, @limit:=0)),
       IF(@limit-qty2 >=0,
          @limit:=(@limit-(@qty2:=qty2)),
          @qty2:=@limit + LEAST(@limit, @limit:=0)),
       IF(@limit-qty3 >=0,
          @limit:=(@limit-(@qty3:=qty3)),
          @qty3:=@limit + LEAST(@limit, @limit:=0)),
       IF(@limit-qty4 >=0,
          @limit:=(@limit-(@qty4:=qty4)),
          @qty4:=@limit + LEAST(@limit, @limit:=0))),0)
;

补语:

SELECT
  id,
  IF(qty1=@qty1, qty1, qty1-@qty1) as qty1,
  IF(qty2=@qty2, qty2, qty2-@qty2) as qty2,
  IF(qty3=@qty3, qty3, qty3-@qty3) as qty3,
  IF(qty4=@qty4, qty4, qty4-@qty4) as qty4
FROM quantities q,
  (SELECT @qty1:=0.0, @qty2:=0.0,
          @qty3:=0.0, @qty4:=0.0,
          @limit:=50.0) init
WHERE
  IF(
    LEAST(
      IF(@limit-qty1 >=0,
         @limit:=(@limit-(@qty1:=qty1)),
         @qty1:=@limit + LEAST(@limit, @limit:=0)),
      IF(@limit-qty2 >=0,
         @limit:=(@limit-(@qty2:=qty2)),
         @qty2:=@limit + LEAST(@limit, @limit:=0)),
      IF(@limit-qty3 >=0,
         @limit:=(@limit-(@qty3:=qty3)),
         @qty3:=@limit + LEAST(@limit, @limit:=0)),
      IF(@limit-qty4 >=0,
         @limit:=(@limit-(@qty4:=qty4)),
         @qty4:=@limit + LEAST(@limit, @limit:=0)),
      @limit), 0, 1)
;

【讨论】:

谢谢!它确实适用于样本数据。让我用我的真实数据测试一下,我会回答你关于赏金的问题......【参考方案4】:

让我们从问题中加载您的示例数据

mysql> drop database if exists javier;
Query OK, 1 row affected (0.02 sec)

mysql> create database javier;
Query OK, 1 row affected (0.01 sec)

mysql> use javier
Database changed
mysql> create table mytable
    -> (
    ->     id int not null auto_increment,
    ->     qty1 float,qty2 float,qty3 float,qty4 float,
    ->     primary key (id)
    -> );
Query OK, 0 rows affected (0.08 sec)

mysql> insert into mytable (qty1,qty2,qty3,qty4) values
    -> ( 0.0 , 0.0 , 10  , 20 ),( 1.5 , 0.0 , 7.5 , 18 ),
    -> ( 1.0 , 2.0 , 7.5 , 18 ),( 0.0 , 0.5 , 5   , 13 );
Query OK, 4 rows affected (0.05 sec)
Records: 4  Duplicates: 0  Warnings: 0

mysql> select * from mytable;
+----+------+------+------+------+
| id | qty1 | qty2 | qty3 | qty4 |
+----+------+------+------+------+
|  1 |    0 |    0 |   10 |   20 |
|  2 |  1.5 |    0 |  7.5 |   18 |
|  3 |    1 |    2 |  7.5 |   18 |
|  4 |    0 |  0.5 |    5 |   13 |
+----+------+------+------+------+
4 rows in set (0.00 sec)

mysql>

完全有效的最终查询

select BBBB.* from (select id,sums FROM (select A.id,A.sums from
(select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB 
where BB.id<=AA.id) sums from mytable AA order by id) A 
INNER JOIN (SELECT 50 mylimit) B ON A.sums <= B.mylimit) AAA
UNION
(select A.id,A.sums from (select id,(select sum(qty1+qty2+qty3+qty4)  
from mytable BB where BB.id<=AA.id) sums from mytable AA order by id) A
where A.sums=(select min(A.sums) sums from (select id, 
(select sum(qty1+qty2+qty3+qty4) from mytable BB where BB.id<=AA.id) sums 
from mytable AA order by id) A INNER JOIN (SELECT 50 mylimit) B 
ON A.sums >= B.mylimit))) AAAA JOIN mytable BBBB USING (id);

完全有效的最终查询补充

select BBBB.* from  mytable BBBB LEFT JOIN
(select id,sums FROM (select A.id,A.sums from ( 
select id,(select sum(qty1+qty2+qty3+qty4)  
from mytable BB where BB.id<=AA.id) sums 
from mytable AA order by id) A INNER JOIN
(SELECT 50 mylimit) B ON A.sums <= B.mylimit) AAA
UNION
(select A.id,A.sums from (select id, 
(select sum(qty1+qty2+qty3+qty4) from mytable BB 
where BB.id<=AA.id) sums from mytable AA order by id) A
where A.sums=(select min(A.sums) sums from ( 
select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB 
where BB.id<=AA.id) sums from mytable AA order by id) A 
INNER JOIN (SELECT 50 mylimit) B ON A.sums >= B.mylimit))) AAAA
USING (id) WHERE AAAA.id IS NULL;

这是 57 的输出

mysql>     select BBBB.* from (select id,sums FROM (select A.id,A.sums from
    ->     (select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     INNER JOIN (SELECT 57 mylimit) B ON A.sums <= B.mylimit) AAA
    ->     UNION
    ->     (select A.id,A.sums from (select id,(select sum(qty1+qty2+qty3+qty4)
    ->     from mytable BB where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     where A.sums=(select min(A.sums) sums from (select id,
    ->     (select sum(qty1+qty2+qty3+qty4) from mytable BB where BB.id<=AA.id) sums
    ->     from mytable AA order by id) A INNER JOIN (SELECT 57 mylimit) B
    ->     ON A.sums >= B.mylimit))) AAAA JOIN mytable BBBB USING (id);
+----+------+------+------+------+
| id | qty1 | qty2 | qty3 | qty4 |
+----+------+------+------+------+
|  1 |    0 |    0 |   10 |   20 |
|  2 |  1.5 |    0 |  7.5 |   18 |
+----+------+------+------+------+
2 rows in set (0.00 sec)

mysql>     select BBBB.* from  mytable BBBB LEFT JOIN
    ->     (select id,sums FROM (select A.id,A.sums from (
    ->     select id,(select sum(qty1+qty2+qty3+qty4)
    ->     from mytable BB where BB.id<=AA.id) sums
    ->     from mytable AA order by id) A INNER JOIN
    ->     (SELECT 57 mylimit) B ON A.sums <= B.mylimit) AAA
    ->     UNION
    ->     (select A.id,A.sums from (select id,
    ->     (select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     where A.sums=(select min(A.sums) sums from (
    ->     select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     INNER JOIN (SELECT 57 mylimit) B ON A.sums >= B.mylimit))) AAAA
    ->     USING (id) WHERE AAAA.id IS NULL;
+----+------+------+------+------+
| id | qty1 | qty2 | qty3 | qty4 |
+----+------+------+------+------+
|  3 |    1 |    2 |  7.5 |   18 |
|  4 |    0 |  0.5 |    5 |   13 |
+----+------+------+------+------+
2 rows in set (0.00 sec)

mysql>

这是 50 的输出

mysql>     select BBBB.* from (select id,sums FROM (select A.id,A.sums from
    ->     (select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     INNER JOIN (SELECT 50 mylimit) B ON A.sums <= B.mylimit) AAA
    ->     UNION
    ->     (select A.id,A.sums from (select id,(select sum(qty1+qty2+qty3+qty4)
    ->     from mytable BB where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     where A.sums=(select min(A.sums) sums from (select id,
    ->     (select sum(qty1+qty2+qty3+qty4) from mytable BB where BB.id<=AA.id) sums
    ->     from mytable AA order by id) A INNER JOIN (SELECT 50 mylimit) B
    ->     ON A.sums >= B.mylimit))) AAAA JOIN mytable BBBB USING (id);
+----+------+------+------+------+
| id | qty1 | qty2 | qty3 | qty4 |
+----+------+------+------+------+
|  1 |    0 |    0 |   10 |   20 |
|  2 |  1.5 |    0 |  7.5 |   18 |
+----+------+------+------+------+
2 rows in set (0.00 sec)

mysql>     select BBBB.* from  mytable BBBB LEFT JOIN
    ->     (select id,sums FROM (select A.id,A.sums from (
    ->     select id,(select sum(qty1+qty2+qty3+qty4)
    ->     from mytable BB where BB.id<=AA.id) sums
    ->     from mytable AA order by id) A INNER JOIN
    ->     (SELECT 50 mylimit) B ON A.sums <= B.mylimit) AAA
    ->     UNION
    ->     (select A.id,A.sums from (select id,
    ->     (select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     where A.sums=(select min(A.sums) sums from (
    ->     select id,(select sum(qty1+qty2+qty3+qty4) from mytable BB
    ->     where BB.id<=AA.id) sums from mytable AA order by id) A
    ->     INNER JOIN (SELECT 50 mylimit) B ON A.sums >= B.mylimit))) AAAA
    ->     USING (id) WHERE AAAA.id IS NULL;
+----+------+------+------+------+
| id | qty1 | qty2 | qty3 | qty4 |
+----+------+------+------+------+
|  3 |    1 |    2 |  7.5 |   18 |
|  4 |    0 |  0.5 |    5 |   13 |
+----+------+------+------+------+
2 rows in set (0.01 sec)

mysql>

请记住在 (SELECT 50 mylimit) 子查询中为 mylimit 设置数字,每次两次。

请告诉我有这个...

【讨论】:

谢谢 :) 但这只是解决了我提出的简单案例,而不是复杂案例;) @Javier 复杂的是什么?如果您的意思是使用其他数字,例如 50,只需在两个查询(最终和最终补码)中将 (SELECT 57 mylimit) 替换为 (SELECT 50 mylimit),它们将完美运行。否则,请解释复杂情况。 我将尝试通过示例来解释,使用我提供的示例数据。在 mylimit=50 的情况下,id=2 的查询结果应为:qty1=1.5, qty2=0.0, qty3=7.5, qty4=11。但是,您的查询没有给我 id=2 的行。可以看到,qty4=11 不是表的数据,而是每行的qty1+qty2+qty3+qty4的值相加,直到达到mylimit值的结果。也就是说,当您添加时达到 50: row1.qty1+row1.qty2+row1.qty3+row1.qty4 然后 +row2.qty1+row2.qty2+row2.qty3 然后 +7 从原始 row2.qty4 中减去=18,这给出了您在所需结果上看到的 7 嗨哈维尔,刚刚签到。请检查最终查询和最终查询补充。它们完全适用于 57 和 50。 我清理了我的答案,以便将工作查询放在首位。我包括了 57 和 50 被执行的完整示例。

以上是关于SQL 选择表中的某些行,以便它们总和为某个值的主要内容,如果未能解决你的问题,请参考以下文章

有没有办法根据 Python 中的一列或多列中具有相似值的行来选择表中的某些行?

SQL通过联结表从其他表中选择总和作为列

如何从 SQL 表中选择特定行并连接 SQL 服务器中的多个表?

如何根据总和值选择特定记录

需要 SQL 来选择行,直到列的总和达到最后一行不会完全消耗值的值

更新表中所有行的有效方法