Spaces:

Callmebowoo-22
/

AI-Supply-chain-Beta

Sleeping

Callmebowoo-22 commited on Apr 18

Commit

dfa098d

verified ·

1 Parent(s): 79636b6

Create preprocessing.py

Files changed (1) hide show

utils/preprocessing.py ADDED Viewed

+import pandas as pd
+from sklearn.ensemble import IsolationForest
+def clean_data(file):
+    """
+    Bersihkan data UMKM dari anomaly (outlier).
+    Contoh input: File CSV dengan kolom: tanggal, demand, supply
+    """
+    # Baca data
+    df = pd.read_csv(file)
+    # Konversi tanggal
+    df['tanggal'] = pd.to_datetime(df['tanggal'])
+    # Deteksi anomaly
+    clf = IsolationForest(contamination=0.05, random_state=42)
+    df['anomaly'] = clf.fit_predict(df[['demand', 'supply']])
+    # Filter data bersih
+    clean_df = df[df['anomaly'] == 1].copy()
+    clean_df.drop('anomaly', axis=1, inplace=True)
+    return clean_df