jacklangerman
/

my_cool_submission_2025

jacklangerman commited on Apr 4

Commit

19f4382

verified ·

1 Parent(s): 979fca8

Upload folder using huggingface_hub

Files changed (2) hide show

README.md CHANGED Viewed

@@ -1,4 +1,14 @@
-# My Cool Submission 2025
-This repo contains a submission to the [S23DR Challenge](https://huggingface.co/spaces/usm3d/S23DR) (part of the [USM3D](https://usm3d.github.io/) workshop at CVPR2025). It was prepared by [jacklangerman](https://huggingface.co/jacklangerman).

+# Empty solution example for the S23DR competition
+This repo provides a minimalistic example of a valid, but empty submission to S23DR competition.
+We recommend you take a look at [this example](https://huggingface.co/usm3d/handcrafted_baseline_submission),
+which implements some primitive algorithms and provides useful I/O and visualization functions.
+This example seeks to simply provide minimal code which succeeds at reading the dataset and producing a solution (in this case two vertices at the origin and edge of zero length connecting them).
+`script.py` - is the main file which is run by the competition space. It should produce `submission.parquet` as the result of the run. Please see the additional comments in the `script.py` file.
+---
+license: apache-2.0
+---

script.py CHANGED Viewed

@@ -56,18 +56,30 @@ if __name__ == "__main__":
         data_path = data_path_local
     print(data_path)
-    print([str(p) for p in data_path.rglob('*validation*.arrow')])
     # dataset = load_dataset(params['dataset'], trust_remote_code=True, use_auth_token=params['token'])
-    dataset = load_dataset(
-        "arrow",
-        data_files={
-            "validation": [str(p) for p in data_path.rglob('*validation*.arrow')],
-            "test": [str(p) for p in data_path.rglob('*test*.arrow')],
-        },
-        trust_remote_code=True,
-        # streaming=True
-    )
     print(dataset, flush=True)
     # dataset = load_dataset('webdataset', data_files={)

         data_path = data_path_local
     print(data_path)
+    print([str(p) for p in data_path.rglob('*validation*.(arrow|tar)')])
     # dataset = load_dataset(params['dataset'], trust_remote_code=True, use_auth_token=params['token'])
+    data_files = {
+        "validation": [str(p) for p in [*data_path.rglob('*validation*.arrow')]+[*data_path.rglob('*validation*.tar')]],
+        "test": [str(p) for p in [*data_path.rglob('*test*.arrow')]+[*data_path.rglob('*test*.tar')]],
+    }
+    try:
+        dataset = load_dataset(
+            "arrow",
+            data_files=data_files,
+            trust_remote_code=True,
+            # streaming=True
+        )
+        print('load with arrow')
+    except:
+        dataset = load_dataset(
+            "webdataset",
+            data_files=data_files,
+            trust_remote_code=True,
+            # streaming=True
+        )
+        print('load with webdataset')
     print(dataset, flush=True)
     # dataset = load_dataset('webdataset', data_files={)