Federico Posted September 2 Share Posted September 2 Good morning, We have installed Rulex Platform (version 1.2.2-191) on a server (the server is running on a virtual machine), and although the installation was successful, the program is extremely slow. For example, performing an action like a join operation takes about 7-8 minutes. Our machine has the following specifications: - CPU: Intel Xeon Gold 6240 @ 2.6 GHz - CPU virtual cores: 12 - RAM: 32 GB -OS: Windows Server 2022 Standard The dataset we are working on is quite large, with over a million records and more than 300 fields. In this regard, what would be the ideal specifications for a machine to efficiently handle this type of workload? With an optimal configuration, is it realistic to expect a significant reduction in the execution time of actions, or would the waiting time still be around 7-8 minutes? Additionally, the Rulex documentation mentions a recommended requirement of a CPU with 8 cores. Is it possible to have more precise requirements, for example regarding the CPU frequency or generation? Thank you, Federico Want to know more? Link to comment Share on other sites More sharing options...
Enrico Posted September 3 Share Posted September 3 Hi Federico, can you please share the rfl with the data in private chat with me in order to let us investigate further? Want to know more? Link to comment Share on other sites More sharing options...
Federico Posted September 3 Author Share Posted September 3 Hi Enrico, unfortunately sharing the file is problematic. Is it possible to have more specific requirements for a server installation, and ideally some sort of benchmark of the expected performance of the software? Thank you, Federico Want to know more? Link to comment Share on other sites More sharing options...
Enrico Posted September 5 Share Posted September 5 Hi, for the moment we don't have more specific prerequisites than the one that you can find on the guide of installation Installing Rulex Platform. Regarding the performance, have you had the chance to read the article that compare Rulex with Pandas? Additionally, something that can slow the join task is the presence of duplicate rows inside the key attributes. To check an eventual presence of duplicates you can change the default option "Duplicated keys policy" to Warning This option now will warn you at the end of the computation of the task, to improve the timing you can remove the duplicates before computing the join in a previous Data Manager. Let us know if you manage to improve your performance Want to know more? Link to comment Share on other sites More sharing options...
pisani Posted September 5 Share Posted September 5 per federico cominciamo a suddividere il problema in 2 parti: la quantità di dati e altri aspetti carica come dati (import da file Excel) i 2 file che Rita ti ha trasmesso recentemente per una analisi statistica e prova a fare una join su questi se la velocità non migliora vuol dire che c'è un problema di installazione, altrimenti il problema è legato alla mole di dati Want to know more? Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.