Tired of writing complex Spark streaming jobs just to land data in your #DataLakehouse?
What if you could go from a Kafka topic to an #ApacheIceberg table with ZERO ETL code? I wrote a deep dive on how to build a code-free, real-time data pipeline. https://t.co/EqbAJHNPkZ
The blog includes a full, reproducible #Docker setup with Kafka Connect, Nessie, MinIO, and Trino so you can get hands-on experience immediately.
Ready to revolutionize your data ingestion strategy?
@dr_alphalyrae Science is a value; academia is an environment. While many may leave the latter, the pursuit of knowledge and discovery remains at the heart of those who cherish the former.
NEW §
SPAs often require a lot of code to be downloaded, which can delay a page's initial load. @JuntaoQiu's next pattern, Code Splitting, describes how this code can be divided up, so that modules are only loaded if they are going to be needed.
https://t.co/AeQpM3nVCh
Thrilled to be in NY this week for #Salesforcetour! Let our team of experts show you how #Data#AI & #ProcessMining can unlock value, boost productivity, reduce operational costs and amaze your customers. Visit us Thurs in the exhibit hall! #salesforce#worldtournyc
Diving deeper into #GenAI at #StartupDay with @marlon_dumas , understanding how the GenAI hype is not just a buzz but a catalyst across various AI domains like robotics, NLP, virtual assistants, and automated decision systems. A reality check on AI’s influence on tech evolution.
Looking for a passionate and motivated Masters graduate to join my research team to develop leading-edge methods for data-driven business process optimization, under a remunerated PhD fellowship - https://t.co/srERoCKktV
#PhDposition#processmining#bpm#processimprovement
@suvorau Amun provides high-utility anonymization. It uses personalized differential privacy to achieve so. Also, Amun is scalable. It can anonymize large event logs.
An empirical evaluation of Amun vs. PM4PY's approaches is presented in our IS paper. https://t.co/v134csHQla
@marlon_dumas @fp_stephan Performing the anonymization over a lossless log representation could also preserve more utility. You can then perform the replay over the representation. We've used DAFSA because it's minimal and sampling instead of replay. That provides high-utility anonymization.