Filter articles by tags or search for specific topics:
Filter articles by tags or search for specific topics:
Original URL: https://medium.com/airbnb-engineering/from-data-to-insights-segmenting-airbnbs-supply-c88aa2bb9399
Added Date: November 26, 2024
Memo: Calculate new features for the user segmentation, and a good share for the validation.
Original URL: https://medium.com/booking-com-development/self-serve-platform-for-scalable-ml-recommendations-358caf217a2e
Added Date: November 26, 2024
Memo: Very flexible and scalable recommendation solution
Original URL: https://aws.amazon.com/blogs/big-data/introducing-generative-ai-troubleshooting-for-apache-spark-in-aws-glue-preview/
Added Date: November 25, 2024
Memo: Classic RAG solution for this kind of application.
Original URL: https://www.notion.so/blog/a-brief-history-of-notions-data-catalog
Added Date: November 24, 2024
Memo: Our team had the similar problem "Despite this integration’s technical success, we soon noticed that the new system was delivering lower-than-expected user engagement." Nice re-thinking about the improvement built!
Original URL: https://engineeringblog.yelp.com/2024/11/loading-data-into-redshift-with-dbt.html
Added Date: November 23, 2024
Memo: Why we load the S3 data into the Redshift again? it already queryable via Redshift Spectrum? I guess it's for the performance? Transform the S3 raw data, build the data models and write back into S3?
Original URL: https://www.harness.io/blog/from-dbt-to-sqlmesh
Added Date: November 21, 2024
Memo:
Original URL: https://medium.com/@nydas/4-key-benefits-of-shift-left-ff0e4bb74a3f
Added Date: November 20, 2024
Memo: This is one of big difference comparing to DE with SE.
Original URL: https://www.uber.com/en-GB/blog/presto-express/
Added Date: November 19, 2024
Memo:
Original URL: https://www.linkedin.com/blog/engineering/infrastructure/right-sizing-spark-executor-memory
Added Date: November 19, 2024
Memo:
Original URL: https://www.getdbt.com/resources/guides/the-analytics-development-lifecycle
Added Date: November 18, 2024
Memo: Fully adapt the SDLC practises for analytic world ...