Date | Name | Amount | Station | Line | Number |
---|---|---|---|---|---|
2021-11-01 | John | 11.2 | 1234 | 1 | 1 |
2021-11-02 | Alice | 13.4 | 1236 | 1 | 2 |
2021-11-01 | David | 12.8 | 1234 | 1 | 3 |
2021-11-01 | Jack | 9.7 | 1234 | 1 | 4 |
2021-11-02 | Zoey | 23.8 | 1236 | 1 | 5 |
2021-11-03 | Ada | 4.5 | 2565 | 1 | 6 |
请问例如上述表格,我需要找出所有Date,Station,Line组合起来相同的行,得到结果应该如下:
Date | Name | Amount | Station | Line | Number |
---|---|---|---|---|---|
2021-11-01 | John | 11.2 | 1234 | 1 | 1 |
2021-11-01 | David | 12.8 | 1234 | 1 | 3 |
2021-11-01 | Jack | 9.7 | 1234 | 1 | 4 |
2021-11-02 | Zoey | 23.8 | 1236 | 1 | 5 |
2021-11-02 | Alice | 13.4 | 1236 | 1 | 2 |
请问使用Spark sql应该怎样写?感谢!
?