A dataframe is a data structure that presents data in form of a table with rows and columns.

Dataframes present characteristics such as:

  • They are Two-dimensional

  • Size-mutable

  • Potentially heterogenous tabular data

These characteristics make them the most preferred when storing and working with data.

As a result, they are the building blocks for most data analytic processes.

DataFrames are similar to SQL tables or the spreadsheets that you work with in excel, however, there is a big difference between them. DataFrames are faster, and more powerful than Excel spreadsheets because they are an integral part of Python and NumPy ecosystems. They can also be used in other languages such as R, Julia, and big data systems like Spark.

Additional Resources: