DataEngineering 80
- Zookeeper가 KRaft로 대체된 이유 Apr 25, 2022
- Flink Architecture Mar 10, 2022
- Debezium Feb 10, 2022
- S3 Connector의 Exactly Once Jan 27, 2022
- HBase 개요 Jan 23, 2022
- [Snowflake] Queries Jan 14, 2022
- Kafka 동작원리 Jan 14, 2022
- MSK Jan 14, 2022
- HPA by Kafka consumerlag Jan 5, 2022
- Kafka에서 Key의 역할 Jan 5, 2022
- Kafka Rebalancing Jan 5, 2022
- Local Kafka Setup Dec 19, 2021
- Kafka Transaction Dec 19, 2021
- MERGE Dec 9, 2021
- Fernet Key Dec 8, 2021
- Trouble shooting log for snowflake Nov 14, 2021
- External table Nov 14, 2021
- Trouble shooting log(Airflow) Oct 17, 2021
- Day1 Oct 2, 2021
- Adding jar or package to Spylon Kernel Aug 24, 2021
- Trouble Shooting(Spark) Aug 23, 2021
- Kafka Command Apr 21, 2021
- Big Data 3V Mar 26, 2021
- Data warehouse, Data Lake Mar 26, 2021
- Apache Arrow Mar 23, 2021
- PySpark UDF Mar 23, 2021
- Parquet Mar 23, 2021
- DataFrame Mar 16, 2021
- Shuffle Mar 14, 2021
- Transformation,Action Mar 13, 2021
- Log Compaction Mar 13, 2021
- HDFS append 동작방식 Mar 12, 2021
- Python to RDD communications Mar 11, 2021
- Data Processing Architecture Mar 10, 2021
- Message Queue 사용이유 Mar 8, 2021
- Kinesis Mar 7, 2021
- Accumulo Mar 7, 2021
- Apache Flink vs Apache Spark Mar 5, 2021
- Spark vs MapReduce Feb 25, 2021
- Spark Architecture Feb 25, 2021
- Executor 자원 결정하기 Feb 25, 2021
- Schema Registry Feb 25, 2021
- Cache,Persist Feb 23, 2021
- Spark Memory Architecture Feb 13, 2021
- [Trouble Shooting] cannot resolve column(numeric column name) in Spark Dataframe Feb 10, 2021
- 중첩된 schema flattening하기 Feb 10, 2021
- Avro vs Parquet Feb 2, 2021
- RDDs vs DataFrames vs Datasets Feb 2, 2021
- HDFS Connector 2 Jan 30, 2021
- Hive Internal table, External table Jan 24, 2021
- Kafka Streams Jan 20, 2021
- SCDF Jan 12, 2021
- Flume Jan 3, 2021
- 예전에 사용되던 시스템들 Jan 3, 2021
- Hue Jan 3, 2021
- Zeppelin Jan 3, 2021
- Oozie Dec 27, 2020
- Zookeeper Dec 27, 2020
- CDH vs HDP Dec 27, 2020
- Hadoop Query Engine Dec 26, 2020
- Kafka configuration 정리 Dec 26, 2020
- Kafka api정리 Dec 26, 2020
- StreamSQL Dec 21, 2020
- OLTP,OLAP Dec 17, 2020
- Hive와Impala Dec 16, 2020
- Apache Pig Dec 16, 2020
- Trouble Shooting(Hadoop) Dec 15, 2020
- MapReduce Dec 15, 2020
- HDFS Dec 15, 2020
- Hadoop Overview Dec 15, 2020
- Kafka Stream vs Spark Streaming Dec 12, 2020
- Batch Stream vs Stream Processing Dec 12, 2020
- Apache Avro Dec 12, 2020
- Kafka Trouble Shooting Dec 4, 2020
- Segment and Log Compaction Policy Dec 3, 2020
- ETL Pipeline Dec 1, 2020
- Apache Kafka 장단점 Nov 30, 2020
- [ROAD TO DATA ENGINEER] Kafka Basic Concepts Oct 6, 2020
- Spark와 Data Engineering 관련 프로젝트 시작하기 Sep 24, 2020
- Apache Storm Sep 22, 2020