logo
down
shadow

Combine Dataframe rows on conditions


Combine Dataframe rows on conditions

Content Index :

Combine Dataframe rows on conditions
Tag : python , By : kiirpi
Date : November 28 2020, 11:01 PM

I hope this helps . sort_values groupby('ID_1') track STOP_D.cummax() and see if START_D is less than prior cummax cumsum to generate groupings agg to grab min START_D and max STOP_D
df_example = df.sort_values(['START_D', 'STOP_D'])

def collapse(df):
    s, e = 'START_D', 'STOP_D'
    grps = df[s].gt(df[e].cummax().shift()).cumsum()
    funcs = {s: 'min', e: 'max', 'ID_1': 'first'}
    return df.groupby(grps).agg(funcs)

df_example.groupby('ID_1').apply(collapse).reset_index(drop=True)

Comments
No Comments Right Now !

Boards Message :
You Must Login Or Sign Up to Add Your Comments .

Share : facebook icon twitter icon

How to subset a dataframe based on some conditions over the rows of dataframe that contains list in r


Tag : r , By : Nic Doye
Date : March 29 2020, 07:55 AM
I wish did fix the issue. I have a dataframe that contains a list of values in each rows of dataframe. I want to check some conditions and based on that condition i want to subset a dataframe. For example in belove code myDF is my dataframe. That is , is that what you want?
myDF$Value_filtered <- lapply(seq(myDF$Value), function(i) myDF$Value[[i]][which(myDF$Value[[i]] %in% vec)])

Combine 1-row dataframe and n-rows dataframe, by duplicating the 1-row and append to n-rows dataframe


Tag : python , By : francisco santos
Date : March 29 2020, 07:55 AM
I wish this help you Use cross join with merge + assign + drop - this solution can be used if want combine multiple rows in d1:
df = pd.merge(d1.assign(A=1), d2.assign(A=1), on='A').drop('A', 1)
print (df)
    Name  Age  Subject Grade
0  Alice   18  Science     A
1  Alice   18     Math     C
2  Alice   18  English     B
df = pd.concat([d1.reindex(d2.index, method='ffill'), d2], axis=1)
print (df)
    Name  Age  Subject Grade
0  Alice   18  Science     A
1  Alice   18     Math     C
2  Alice   18  English     B

Pandas DataFrame: programmatic rows split of a dataframe on multiple columns conditions


Tag : python , By : Rob
Date : October 18 2020, 08:10 AM
With these it helps After several attempts, I managed to achieve my goal.
Here is the code:
import Pandas
import numpy
# assume dataframe exists
df = ...
# initiliaze an array of False, matching df number of rows
resulting_bools = numpy.zeros((1, len(df.index)), dtype=bool)

for col in list_cols:
    # obtain array of booleans for given column and boolean condition for [row, column] value
    criterion = df[col].map(lambda x: x < 0) # same condition for each column, different conditions would have been more difficult (for me)

     # perform cumulative boolean evaluation accross columns
    resulting_bools |= criterion

# use the array of booleans to build the required df
negative_values_matches = df[ resulting_bools].copy() # use .copy() to avoid further possible warnings from Pandas depending on what you do with your data frame
positive_values_matches = df[~resulting_bools].copy()

Delete rows of Dataframe based on multiple conditions from different Dataframe


Tag : python , By : unadopted
Date : March 29 2020, 07:55 AM
With these it helps df1.loc[(df1['date']==dayA)& (df1['location']==placeA)] is the dataframe consisting of rows where the date and location match. drop is expecting the index where they match. So you need df1.loc[(df1['date']==dayA)& (df1['location']==placeA)].index. However, this is a very inefficient method. You can use merge instead as the other answers discuss. Another method would be df1 = df1.loc[~df1[['date','location']].apply(tuple,axis=1).isin(zip(df2.date,df2.location))].

How to extract rows of a pandas dataframe according to conditions based on another dataframe


Tag : python , By : dbarbot
Date : March 29 2020, 07:55 AM
around this issue I have these two dataframes : , Try this:
df1[df1['Points'].isin(df2.query('0 <= Sum <= 2')['Points'])]
  Points  ColX
4       5     5
Related Posts Related QUESTIONS :
  • Installing megam for NLTK on Windows
  • filter dataframe on each value of a samn column have a specific value of another column in Panda\Python
  • Threading with pubsub throwing AssertionError: 'callableObj is not callable' in wxPython
  • Get grouped data from 2 dataframes with condition
  • How can I import all of sklearns regressors
  • How to take all elements except the first k
  • Whats wrong with my iteration list of lists from csv
  • Tensorflow Estimator API save image summary in eval mode
  • How to Pack with PyQt - how to make QFrame/Layout adapt to content
  • How do I get certain Time Range in Python
  • python doubly linked list - insertAfter node
  • Open .h5 file in Python
  • Joining a directory name with a binary file name
  • python, sort list with two arguments in compare function
  • Is it possible to print from Python using non-ANSI colors?
  • Pandas concat historical data using date minus some number of days
  • CV2: Import Error in Python OpenCV
  • Is it possible to do this loop in a one-liner?
  • invalid literal for int() with base 10: - django
  • Why does my code print a value that I have not assigned as yet?
  • the collatz func in automate boring stuff with python
  • How to find all possible combinations of parameters and funtions
  • about backpropagation deep neural network in tensorflow
  • Sort strings in pandas
  • How do access my flask app hosted in docker?
  • Replace the sentence include some text with Python regex
  • Counting the most common element in a 2D List in Python
  • logout a user from the system using a function in python
  • mp4 metadata not found but exists
  • Django: QuerySet with ExpressionWrapper
  • Pandas string search in list of dicts
  • Decryption from RSA encrypted string from sqlite is not the same
  • need of maximum value in int
  • a list of several tuples, how to extract the same of the first two elements in the small tuple in the large tuple
  • Display image of 2D Sinewaves in 3D
  • how to prevent a for loop from overwriting a dictionary?
  • How To Fix: RuntimeError: size mismatch in pyTorch
  • Concatenating two Pandas DataFrames while maintaining index order
  • Why does this not run into an infinite loop?
  • Python Multithreading no current event loop
  • Element Tree - Seaching for specific element value without looping
  • Ignore Nulls in pandas map dictionary
  • How do I get scrap data from web pages using beautifulsoup in python
  • Variable used, golobal or local?
  • I have a regex statement to pull all numbers out of a text file, but it only finds 77 out of the 81 numbers in the file
  • How do I create a dataframe of jobs and companies that includes hyperlinks?
  • Detect if user has clicked the 'maximized' button
  • Does flask_login automatically set the "next" argument?
  • Indents in python 3
  • How to create a pool of threads
  • Pandas giving IndexError on one dataframe but not on another similar dataframe
  • Django Rest Framework - Testing client.login doesn't login user, ret anonymous user
  • Running dag without dag file in airflow
  • Filling across a specified dimension of a numpy array
  • Python populating dataframe in pandas from text files
  • How to interpolate a single ("non-piecewise") cubic spline from a set of data points?
  • Divide 2 integers (leetcode 29) - recursion issue
  • Can someone explain why do I get this output in Python?
  • How do I scrape pdf and html from search results without obvious url
  • Is there a way to automatically make a "collage" of plots with matplotlib?
  • shadow
    Privacy Policy - Terms - Contact Us © scrbit.com