Should your Data Platform rely on Lambda or Kappa architecture?
Lambda vs Kappa Architektur for Data Lakehousing - Which one is the best choice for your Cloud Data Lakehouse?
Lambda vs Kappa Architektur for Data Lakehousing - Which one is the best choice for your Cloud Data Lakehouse?
Data Engineering is not just about enterprise-internal data but also about data gathering from external sources and data security.
Why data engineering has long since stolen the show from data science in terms of importance and career opportunities, but is itself subject to constant change. Data Engineer Job Profile…
There are occasions when one or more serverless functions are not sufficient to represent a service. For these cases, there is Google Cloud Run on the Google Cloud Platform. Cloud…
Process Mining is a method of data analysis to data-driven audit, monitor or analyze process flows. See our Infographic how it works.
A fuzzy matching was used to combine the data from the two different sources. A selection of fuzzy string-matching algorithms was tested, for example Jaro-Winkler Distance, Levenshtein distance, Soundex or cosine similarity. The open-source algorithms can be very efficient and there is a selection to choose from depending on the use case.
Every data scientist, data analyst or data engineer rarely works only with open data, but with internal company data that is of great importance for business success. All the more reason why these experts in data storage and analysis should always think about data security and observe certain rules and principles. In addition to the technical security of the data, legal security also plays a role.
Big data already plays a decisive role in almost all industries and has become an elementary earnings factor. Nevertheless, only a few companies have a mature data strategy in order…