Skip to content
#

csv-processing

Here are 27 public repositories matching this topic...

Pipeline-Genie is an intelligent data pipeline that processes CSV datasets, identifies their schema, and leverages LLaMA 2.0 to extract business insights. Users can select relevant business needs, triggering automated ETL transformations using Apache Spark. The final transformed dataset is stored in AWS S3 and made available for download.

  • Updated Feb 21, 2025
  • Python

In this project, I analyze commercial sales data using NumPy and pandas. I visualize total revenue per product using color-coded bar charts in Matplotlib. It’s a foundational step in business data analysis and project documentation.

  • Updated Jul 21, 2025
  • Python

Python-ΠΏΡ€ΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΠ΅ для сопоставлСния Π½ΠΎΠΌΠ΅Ρ€ΠΎΠ² ΠΈΠ· Π²Ρ‹Π³Ρ€ΡƒΠ·ΠΊΠΈ Active Directory с Π΄Π°Π½Π½Ρ‹ΠΌΠΈ ΠΈΠ· .csv/.txt, с Π²Ρ‹Π²ΠΎΠ΄ΠΎΠΌ Π² CSV ΠΈ Π»ΠΎΠ³ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ.

  • Updated Apr 30, 2025
  • Python

A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.

  • Updated Feb 7, 2025
  • Python

This repository showcases a complete Python-based ETL (Extract, Transform, Load) data pipeline designed to process, validate, and analyze weather data for multiple cities. The project demonstrates a structured approach to handling weather data, focusing on data accuracy, transformation, and insights generation.

  • Updated Jul 30, 2025
  • Python

The 🐲EMU RPG API🐲 supports the EMU RPG Club’s events by managing game tables, players, and D&D character data. Built with FastAPI, it includes features like table/character management, real-time WebSocket updates, data validation, API monitoring, and secure access, providing an organized backend for tabletop RPG sessions.

  • Updated Mar 25, 2025
  • Python

A Streamlit app that uses OpenAI's LLM for natural language data analysis. Upload CSV files, ask questions in plain English, and get instant insights. Powered by PandasAI, it's designed for quick, code-free exploration of structured data.

  • Updated Apr 22, 2025
  • Python

This Project aims to implement a **Hadoop MapReduce job in Pseudo-Distributed Mode** to determine the **feistiest PokΓ©mon** based on their **type**. The job processes the PokΓ©mon dataset (`pokemon.csv`) and outputs a CSV file containing PokΓ©mon **type1, type2, name, and feistiness score**.

  • Updated May 8, 2025
  • Python

Improve this page

Add a description, image, and links to the csv-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the csv-processing topic, visit your repo's landing page and select "manage topics."

Learn more