Translating code to make scripts more efficient in a production environment
In this article I am going to explain the scenario I have been working on the past year translating code from Pandas to PySpark to improve performance times and make scripts more efficient in a production environment. Let’s start understanding these technologies....
How to measure Data Democratization
Data democratization refers to the process of making information and data available and accessible to a wider audience, rather than limiting access to a select group of individuals or departments within an organization. This concept seeks to empower more people within...
Data Privacy, the importance of securely generating data in your business
In the new world of the digital age, where information is basically power, there is no doubt that data privacy is an imperative for all companies. Protecting personal, behavioural and business data is no longer just a legal issue, but an ethical responsibility and a...