Skip to content
jd-anjinlong edited this page Jan 17, 2023 · 2 revisions

Welcome to the rtf-lake wiki!

JD RTF real-time data lake is a system rebuilt from the bottom, which solves the ETL process of data access, analysis and cleaning, and also solves the real-time performance that cannot be achieved by traditional offline mode and the data cleaning and restoration that cannot be achieved by streaming real-time data. It is a set of innovative real-time data solutions in the field of big data. RTF can directly query the latest status of data without duplication. It allows data analysts to obtain real-time data for analysis even if they do not understand real-time computing frameworks such as Flink or Spark.

Team Profile:

Core architect Liu Yehui, from JD Big Data Department

Core engineer An Jinlong, from JD Big Data Department

Core engineer Chen Jianfei, from JD Big Data Department

京东RTF实时数据湖,是一个从底层重新构建的系统,解决了数据的接入、解析及清洗等ETL 过程,同时解决了传统离线模式达不到的实时性和流式实时数据做不到的数据清洗、还原,是一套大数据领域改革性的实时数据方案。RTF可以直接查询最新状态的数据,并且无需去重,可以让数据分析人员即使不了解flink或spark等实时计算框架,也能够获取实时数据进行分析。

团队简介:

核心架构师 刘业辉 ,来自京东大数据部

核心工程师 安金龙 ,来自京东大数据部

核心工程师 陈建霏 ,来自京东大数据部

Clone this wiki locally