Publications des agents du Cirad

Cirad

Pharo DataFrame: Past, Present, and Future

Safina L., Zaitsev O., Ferlicot-Delbecque C., Sow P.I.. 2023. In : Stéphane Ducasse (ed.), Gordana Rakic (ed.). Proceedings of the International Workshop on Smalltalk Technologies (IWST 2023) Vol 3627. Lyon : CEUR-WS, 11 p.. International Workshop on Smalltalk Technologies (IWST'2023), 2023-08-29/2023-08-31, Lyon (France).

DataFrame is a tabular data structure for data analysis. It is a two-dimensional table (similar to a spreadsheet) with an extensive API for querying and manipulating the data. Data frames are available in many programming languages (e.g., pandas in Python or data.frame in R), they are the go-to tools for data scientists and machine learning practitioners. Pharo DataFrame was first released in 2017. Since then, the library underwent many changes and improvements. In this paper, we present the Pharo DataFrame library, show examples of its usage, and compare its API to that of pandas. We overview the changes that have been made since DataFrame v1.0, discuss the limitations of the current implementation, and present the roadmap for future.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :