Spaces:

awacke1
/

VoiceGPT15

Sleeping

awacke1 commited on Jul 7, 2023

Commit

cad979c

1 Parent(s): e241a76

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -226,7 +226,7 @@ def pdf2txt(pdf_docs):
         # You need to replace the following lines with actual file reading
         # based on the file_extension
         if file_extension in ['txt', 'html', 'htm', 'py', 'xml', 'json']:
-            text += textract.process(str(file).decode("utf-8") )
             text += f"\nExtracted text from {file_extension} file..."
         elif file_extension == 'pdf':
             pdf_reader = PdfReader(file)

         # You need to replace the following lines with actual file reading
         # based on the file_extension
         if file_extension in ['txt', 'html', 'htm', 'py', 'xml', 'json']:
+            text += textract.process(str(file))
             text += f"\nExtracted text from {file_extension} file..."
         elif file_extension == 'pdf':
             pdf_reader = PdfReader(file)