Recommended Datasets for Course Project

With your suggestion, I now get the following traceback

File “”, line 2
ple_2015=pd.read_csv(‘ple.csv’,engine=‘python3’)
^
SyntaxError: invalid character in identifier

With you suggestion, I get the following traceback

File “”, line 2
ple_2015=pd.read_csv(‘ple.csv’,encoding=“utf-8”)
^
SyntaxError: invalid character in identifier

For some reason the quotation marks in the code are not the standard ASCII quotation marks. Python is interpreting those as invalid characters.

I’m not sure which keyboard or language, you are using, but you need to make sure they are the following:

single quotation mark: '
double quotation mark: "

1 Like

I think this happens because someone is being lazy and copies directly from forum.

There must be some different encoding regarding to quotes or something :stuck_out_tongue:

1 Like

I’m having problems with downloading data from this link https://www.kaggle.com/lava18/google-play-store-apps. I both want to download the googleplaystore.csv and googleplaystore_user_reviews.csv, but i don’t know how to download data from kaggle. Is there anyway to import file from notebooks.

1 Like

Given a column with values as 10,000,100 or 1000+ how to convert these into and integer and put back into same column

You can use IO library.
import io

and replace file read code with
with io.open(file, 'r', encoding="utf-8") as raw_data:

It will work.

Can we use the dataset one or two years old, I mean up to 2018 or 2019? Or should we use the latest dataset?

as we learnt how to drop columns, is it possible to drop rows in a data frame? if yes how to?

This is how to drop row 1 in name_df

name_df.drop(1, axis=0)

Multiple rows

rows_list = [1,2,3,4,5] 
name_df.drop(rows_list, axis=0)

Hi we can pick the dataset from this Some interesting datasets right?
I have picked this dataset: Google Play Store Android Apps Data. My course project can be found here. Please let me know if I have to use a different dataset. Thanks!

Hello - my binder keeps failing to run:

  1. Created a new notebook myself and after trying to run… i keep getting these errors:
    “Sorry, https%3A%2F%2Fjovian.ai%2Fapi%2Fgit%2F3998a84f4ea04268ab733aa72d6d82a9_1.git/0b0acc728b73737bd4c2bc4bc6af256e55761597 has been temporarily disabled from launching. Please contact admins for more info!”

  2. I duplicated Aakash’s notebook and after i try to run below is the error message:
    Sorry, https%3A%2F%2Fjovian.ai%2Fapi%2Fgit%2F0ec1e098693a4e0ba763871445f52b12_1.git/c2f11e6e23ca0843aae3a851fd483fbbf71342d2 has been temporarily disabled from launching. Please contact admins for more info!

Can someone help? Thanks!

Same problem with me too.

can anyone help how i can choose dataset from project

They have made it easier now. Paste a Kaggle link in the dataset_url = ‘(paste link)’ Double check that the data you obtain is CSV

Can you help me in this …
And i am getting to enter the credentials. And i didn’t got this before …

Do you want to know the steps to download data from open datasets? I have already answered this on a earlier issue check this.

what is the" kaggle key" while downloading the dataset fro kaggle

Please follow the link printed in the message. It will lead you to the instructions given here: https://github.com/JovianML/opendatasets/blob/master/README.md#kaggle-credentials