我正在研究文本挖掘问题并使用 Pandas 进行文本处理。从以下示例中,我只需要选择在同一类别 ( ) 中具有最大跨度 ( start- end) 的那些行cat鉴于此数据框: name start end cat0 coumadin 0 8 DRUG1 albuterol 18 27 DRUG2 albuterol sulfate 18 35 DRUG3 sulfate 28 35 DRUG4 2.5 36 39 STRENGTH5 2.5 mg 36 42 STRENGTH6 2.5 mg /3 ml 36 48 STRENGTH7 0.083 50 55 STRENGTH8 0.083 % 50 57 STRENGTH9 2.5 mg /3 ml (0.083 %) 36 58 STRENGTH10 solution 59 67 FORM11 solution for nebulization 59 84 FORM12 nebulization 72 84 ROUTE13 one (1) 90 97 FREQUENCY14 neb 98 101 ROUTE15 neb inhalation 98 112 ROUTE16 inhalation 102 112 ROUTE17 q4h 113 116 FREQUENCY18 every 118 123 FREQUENCY19 every 4 hours 118 131 FREQUENCY20 q4h (every 4 hours) 113 132 FREQUENCY21 q4h (every 4 hours) as needed 113 142 FREQUENCY22 as needed 133 142 FREQUENCY23 dyspnea 147 154 REASON
添加回答
举报
0/150
提交
取消