How to do df.apply parallelly ?

barackobama · November 26, 2020, 10:21pm

I have a code segment

data_preprocess_segment_5 = df.apply(lambda x: ' '.join(segment(x)))

which takes a lot of time maybe more than 24hrs to process.
I was thinking to do

df[0:1000].apply(lambda x: ' '.join(segment(x)))
df[1000:2000].apply(lambda x: ' '.join(segment(x)))
df[2000:3000].apply(lambda x: ' '.join(segment(x)))

parallelly and later merge those df

pr0ph3t · December 17, 2020, 6:55pm

@Abhiram Shibu

Topic		Replies	Views
Is there a way to split files over multiple cores ? Linux	1	3122	December 6, 2022
Matplot lib or VisPy Programming machine-learning-and	1	2242	May 11, 2018
Two functions may give the same result or Does it??? Think before you Type!!! Programming	0	2654	June 28, 2017
Code not compiling on raspberry pi 4 but compiles fine on rasperry pi 3 B [Raspberry Pi] Electronics	1	2599	November 18, 2020
Is there any free vpn in python ? Programming	0	2590	August 14, 2018

How to do df.apply parallelly ?

Related topics