「练习」Numpy 与 Pandas 读取文件速度对比

本题就是操作 Numpy 与 Pandas 读取文件,并对比速度:

数据集:

import numpy as np
import pandas as pd
import time

start_time = time.time()
data = np.genfromtxt('./rating.txt', delimiter=',')
end_reading_time = time.time()
print('Numpy reading time: {}ms'.format(round((end_reading_time - start_time) * 1000, 2)))

start_time = time.time()
data = pd.read_table('./rating.csv', 
    names=['user_id', 'book_id', 'rating'],
    sep=',')
end_reading_time = time.time()
print('Pandas reading time: {}ms'.format(round((end_reading_time - start_time) * 1000, 2)))

# 输出
Numpy reading time: 27029.64ms
Pandas reading time: 1003.31ms
AI悦创·创造不同!
AI悦创 » 「练习」Numpy 与 Pandas 读取文件速度对比

Leave a Reply