跳到主要内容

· 阅读需 7 分钟

A detailed blog post

already explains the ins and outs of why Scala users can now use the Flink Java API with any Scala version (including Scala 3).

In the end, removing Scala is just part of a larger effort of cleaning up and updating various technologies from the Flink ecosystem.

· 阅读需 5 分钟

准备表和数据

show full builtin  functions in test_db like 'year';
CREATE TABLE table1 
(
siteid INT DEFAULT '10',
citycode SMALLINT,
username VARCHAR(32) DEFAULT '',
pv BIGINT SUM DEFAULT '0'
)
AGGREGATE KEY(siteid, citycode, username)
DISTRIBUTED BY HASH(siteid) BUCKETS 10
PROPERTIES("replication_num" = "1");
insert into table1 values 
(1,1,'jim',2),
(2,1,'grace',2),
(3,2,'tom',2),
(4,3,'bush',3),
(5,3,'helen',3);

· 阅读需 1 分钟

Hbase 存储和查询维度数据

  • Hbase 数据分析效率低 分析函数少,Hbase数据分析涉及磁盘读取数据,比较版本,聚合导致效率低下

ClickHouse 存储和查询宽表数据

  • 更擅长数据分析而不是查询明细,维度数据不宜存放 ClickHouse