.CSV -> .DB viel zu lange dauert

.CSV -> .DB viel zu lange dauert ⇐ Python

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Anonymous

Post by Anonymous » 19 May 2025, 07:59

Mein Code (Python) funktioniert, aber mit dem Rate, in dem er ausgeführt wird, dauert es 72 Stunden, bis er fertig ist. Ich versuche, eine CSV-Datei in eine Datenbankdatei zu verwandeln, in der Hoffnung, dass sie einfacher zu verwenden und auf die Daten zugreifen zu können (bevor ich 2 Minuten darauf warten, dass Python das CSV jedes Mal in eine Liste lädt, wenn ich einen Code mit ihm ausführte).

Code: Select all

from datetime import date

#works = "Out_Files\works-20210226.csv"
#tags = "Out_Files\tags-20210226.csv"

with sqlite3.connect("2025\works-20210226.db") as con:
cur = con.cursor()
cmd = "CREATE TABLE works (creation_date INT, language STRING, restricted INT, complete INT, word_count INT, tags STRING)"
cur.execute(cmd)
con.commit()
with open("Out_Files\works-20210226.csv", 'r', encoding='utf-8') as f:
i = 0
next = f.readline()
next = f.readline()
while next:
thedate,language,restricted,complete,word_count,tags = next.split(",")
thedate = [int(each) for each in thedate.split("-")]
thedate = date(thedate[0], thedate[1], thedate[2]).toordinal()
restricted = int(restricted == "true")
complete = int(complete == 'true')
word_count = int(word_count) if word_count else 0

cmd = f"INSERT INTO works VALUES ({thedate}, '{language}', {restricted}, {complete}, {word_count}, '{tags}')"
cur.execute(cmd)
con.commit()

next = f.readline()
i += 1
print(i)

cmd = "SELECT * FROM works WHEN word_count = 1836"
print(cur.execute(cmd))
< /code>
Ich bin derzeit bei i = 733,590 von 7,269,695 < /p>

Bearbeiten: Behoben! Vielen Dank an @mark tolonen < /p>
import sqlite3
from datetime import date
import pandas
import numpy
import sys
#import bs4

#works = "Out_Files\works-20210226.csv"
#tags = "Out_Files\tags-20210226.csv"

#sys.path.append("c:\users\sutli\appdata\local\packages\pythonsoftwarefoundation.python.3.10_qbz5n2kfra8p0\localcache\local-packages\python310\site-packages")

#print(sys.path)

def load():
with sqlite3.connect("2025\works-20210226.db") as con:
cur = con.cursor()
#cmd = "CREATE TABLE works (creation_date INT, language STRING, restricted INT, complete INT, word_count INT, tags STRING)"
#cur.execute(cmd)
con.commit()
#with open("Out_Files\works-20210226.csv", 'r', encoding='utf-8') as f:
df = pandas.read_csv("Out_Files\works-20210226.csv")

def fixDate(d):
d = [int(each) for each in d.split("-")]
d = date(d[0], d[1], d[2]).toordinal()

#def fixWords(w):
#    return int(w) if not pandas.notna(w) else 0

df['creation date'].apply(fixDate)
#df['word_count'].apply(fixWords)
df.astype({'restricted': 'bool', 'complete':'bool'})

df.to_sql("works", con)
con.commit()

cmd = "SELECT * FROM works WHEN word_count = 1836"
print(cur.execute(cmd))

1747634382

Anonymous

Mein Code (Python) funktioniert, aber mit dem Rate, in dem er ausgeführt wird, dauert es 72 Stunden, bis er fertig ist. Ich versuche, eine CSV-Datei in eine Datenbankdatei zu verwandeln, in der Hoffnung, dass sie einfacher zu verwenden und auf die Daten zugreifen zu können (bevor ich 2 Minuten darauf warten, dass Python das CSV jedes Mal in eine Liste lädt, wenn ich einen Code mit ihm ausführte).[code]from datetime import date

#works = "Out_Files\works-20210226.csv"
#tags = "Out_Files\tags-20210226.csv"

with sqlite3.connect("2025\works-20210226.db") as con:
cur = con.cursor()
cmd = "CREATE TABLE works (creation_date INT, language STRING, restricted INT, complete INT, word_count INT, tags STRING)"
cur.execute(cmd)
con.commit()
with open("Out_Files\works-20210226.csv", 'r', encoding='utf-8') as f:
i = 0
next = f.readline()
next = f.readline()
while next:
thedate,language,restricted,complete,word_count,tags = next.split(",")
thedate = [int(each) for each in thedate.split("-")]
thedate = date(thedate[0], thedate[1], thedate[2]).toordinal()
restricted = int(restricted == "true")
complete = int(complete == 'true')
word_count = int(word_count) if word_count else 0

cmd = f"INSERT INTO works VALUES ({thedate}, '{language}', {restricted}, {complete}, {word_count}, '{tags}')"
cur.execute(cmd)
con.commit()

next = f.readline()
i += 1
print(i)

cmd = "SELECT * FROM works WHEN word_count = 1836"
print(cur.execute(cmd))
< /code>
Ich bin derzeit bei i = 733,590 von 7,269,695 < /p>

Bearbeiten: Behoben! Vielen Dank an @mark tolonen < /p>
import sqlite3
from datetime import date
import pandas
import numpy
import sys
#import bs4

#works = "Out_Files\works-20210226.csv"
#tags = "Out_Files\tags-20210226.csv"

#sys.path.append("c:\users\sutli\appdata\local\packages\pythonsoftwarefoundation.python.3.10_qbz5n2kfra8p0\localcache\local-packages\python310\site-packages")

#print(sys.path)

def load():
with sqlite3.connect("2025\works-20210226.db") as con:
cur = con.cursor()
#cmd = "CREATE TABLE works (creation_date INT, language STRING, restricted INT, complete INT, word_count INT, tags STRING)"
#cur.execute(cmd)
con.commit()
#with open("Out_Files\works-20210226.csv", 'r', encoding='utf-8') as f:
df = pandas.read_csv("Out_Files\works-20210226.csv")

def fixDate(d):
d = [int(each) for each in d.split("-")]
d = date(d[0], d[1], d[2]).toordinal()

#def fixWords(w):
#    return int(w) if not pandas.notna(w) else 0

df['creation date'].apply(fixDate)
#df['word_count'].apply(fixWords)
df.astype({'restricted': 'bool', 'complete':'bool'})

df.to_sql("works", con)
con.commit()

cmd = "SELECT * FROM works WHEN word_count = 1836"
print(cur.execute(cmd))
[/code]

Post Reply Previous topic Next topic

1 post • Page 1 of 1

Quick Reply

Username:

Change Text Case:

Smilies

View more smilies

Similar Topics

Replies

Views

Last post

Der Docker-Build dauert zu lange, wenn Grpcio über PIP installiert wird

Last post by Guest « 07 Jan 2025, 13:17
Posted in Python

by Guest » 07 Jan 2025, 13:17 » in Python

Ich habe eine Docker-Datei, die einige Pakete über pip installiert.
Einige davon erfordern grpcio, und die Erstellung dieses Teils dauert nur wenige Minuten.
Hat jemand einen Tipp, um diesen Teil zu...

0 Replies

13 Views

Last post by Guest
07 Jan 2025, 13:17
Das Minimieren von Scipy.optimize dauert zu lange

Last post by Guest « 08 Jan 2025, 09:01
Posted in Python

by Guest » 08 Jan 2025, 09:01 » in Python

Ich führe ein eingeschränktes Optimierungsproblem mit etwa 1500 Variablen aus und die Ausführung dauert über 30 Minuten....

Wenn ich die Toleranz auf 1 reduziere Die Minimierung wird in etwa fünf...

0 Replies

14 Views

Last post by Guest
08 Jan 2025, 09:01
Das Hochladen des Flutter Firebase-Speichers dauert zu lange, aber nur unter IOS

Last post by Guest « 12 Jan 2025, 08:55
Posted in IOS

by Guest » 12 Jan 2025, 08:55 » in IOS

Ich arbeite an einer Upload-Funktion für meine Anwendung für Android und iOS und verwende dafür Firebase Storage. Ich lade kleine Bilder hoch, deren Größe kaum 100 KB überschreitet. Auf meinem...

0 Replies

16 Views

Last post by Guest
12 Jan 2025, 08:55
Warum dauert das Laden so lange Materialsymbole und wie kann ich es optimieren?

Last post by Guest « 25 Jan 2025, 14:35
Posted in HTML

by Guest » 25 Jan 2025, 14:35 » in HTML

Im Moment verwende ich Materialsymbole für meine Website. Es funktioniert großartig! Es ist hübsch, einfach zu bedienen und passt hervorragend zu meinem Design. Allerdings fällt mir auf, dass das...

0 Replies

16 Views

Last post by Guest
25 Jan 2025, 14:35
Das Debugging dauert so lange im VS -Code

Last post by Guest « 05 Feb 2025, 03:22
Posted in Python

by Guest » 05 Feb 2025, 03:22 » in Python

Ich drucke nur
print ( Hallo Welt !!! )
Aber das Debugging dauert so lange (wie 30 Sekunden) und ich weiß, dass dies nur eine Sekunde dauern sollte.
Was könnte das Problem sein?

0 Replies

17 Views

Last post by Guest
05 Feb 2025, 03:22

Return to “Python”